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PREFACE 


NUMEROUS  changes,  in  the  form  of  additions,  omissions,  and  altera- 
tions, have  been  made  in  this  edition.  Chapters  I,  V  and  VI  remain 
with  little  alteration,  Chapters  III  and  VIII  have  been  expanded  and 
altered,  while  Chapters  II,  IV,  VII,  IX  and  X  are  entirely  or  almost 
entirely  new.  These  changes  are  wholly  the  work  of  Professor  Godfrey 
H.  Thomson,  and  the  merit  of  the  very  great  improvement  in  the  book 
is  due  to  him.  Owing  to  my  own  time  during  the  War  being  taken  up 
entirely  with  army  medical  work,  I  have  been  unable  to  take  any  part 
in  the  further  development  of  correlational  psychology  since  1914,  and 
it  was  therefore  a  very  great  relief  to  me  when  Professor  Thomson 
kindly  offered  to  cooperate  in  bringing  out  a  second  edition  of  Mental 
Measurement.  No  one  could  have  been  more  fitted  for  this  task,  both 
on  the  mathematical  and  on  the  psychological  side,  and  he  has  produced 
what  is  in  half  its  extent  a  new  book,  vastly  superior  to  the  old.  Some 
of  the  sections  are  purely  mathematical,  and  these  will  perhaps  appeal 
to  a  wider  circle  of  readers. 

As  this  second  edition,  like  the  first,  is  opposed  to  the  theories  of 
Professor  C.  Spearman,  I  wish  to  take  the  present  opportunity  of  saying 
that  as  an  opponent  I  have  learnt  to  respect  and  admire  his  work  to 
a  very  great  degree,  and  looking  back  dispassionately  over  six  years 
spent  in  other  forms  of  psychological  research,  I  find  myself  more  con- 
vinced than  ever  that  his  work  in  correlational  psychology  is  epoch- 
making  in  its  significance.  Although  I  have  found  no  occasion  to 
withdraw  any  of  my  own  earlier  criticisms,  I  have  gladly  taken  the 
opportunity  of  modifying  the  wording  in  many  respects,  in  order  to  do 
better  justice  to  the  outstanding  importance  of  his  views.  Whatever 
may  be  the  ultimate  verdict  on  the  significance  of  "hierarchical  order" 
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in  groups  of  correlation  coefficients,  Professor  Spearman's  correlational 
work  in  other  directions  would  alone  justify  his  unique  position  in  the 
domain  of  statistical  psychology. 

It  has  seemed  to  me  to  be  essential  that  Professor  Thomson  should 
contribute  a  separate  preface,  since  so  much  of  this  edition  is  concerned 
with  the  vindication  of  a  theory  which  originated  from  his  brain  alone. 
Should  a  third  edition  be  called  for  later  on,  I  hope  to  be  able  to  add 
a  "Part  III,"  summarising  results  and  theories  in  non-mathematical 
language.  But  at  present  the  battle  rages  so  hotly  that  this  is  not 
desirable,  even  if  it  were  possible. 

W.  B. 

KING'S  COLLEGE, 
LONDON. 

August,  1920. 


n 

Dr  Brown's  suggestion  that  we  should  write  separate  prefaces  to 
this  book  appeals  to  me  for  one  reason  only,  that  it  enables  me  to 
shoulder,  myself,  the  responsibility  for  any  errors  either  in  principles 
or  in  detail  which  this  edition  may  contain.  Any  credit  I  am  glad  to 
share,  but  the  faults  are  my  own. 

The  book  is  necessarily  concerned  largely  with  controversial  matters. 
On  one  of  these,  the  question  of  the  validity  of  the  reasoning  which 
Professor  Spearman  has  based  on  the  occurrence  of  hierarchical  order, 
I  am  entirely  convinced  that  the  situation  is  as  described  in  Chapters  IX 
and  X.  My  position  is  that  hierarchical  order  is  the  natural  order  among 
correlation  coefficients,  that  it  only  expresses  the  well-known  fact  that 
correlation  coefficients  are  themselves  correlated,  and  that  the  degree 
of  perfection  of  hierarchical  order  found  among  psychological  correlation 
coefficients  is  merely  that  which  occurs  by  chance,  and  not,  as  Professor 
Spearman  has  been  led  to  believe,  extraordinarily  high. 

On  this  point,  then,  I  feel  certain  that  Professor  Spearman  has  drawn 
over-hasty  conclusions,  conclusions  which  may  be,  by  a  fine  instinct, 
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the  true  conclusions,  but  conclusions  which  are  totally  unsupported  by 
the  "hierarchy"  argument. 

On  a  second  point  at  issue,  however,  namely  the  question  of  the  use 
of  ranks  instead  of  measurements  in  calculating  correlations,  I  feel  much 
less  competent  to  speak.  There  is  a  good  deal  to  be  said  on  both  sides. 
The  crux  of  the  question  to  my  mind  is  this.  Admittedly  the  form  of 
distribution  of  any  set  of  scores  in  a  mental  test  is  not  necessarily 
identical  with  the  distribution  of  the  true  scores  in  mental  units.  If 
we  give  up  the  actual  scores  and  merely  retain  the  order  of  merit  we 
are,  it  would  seem,  only  giving  up  a  false  claim  to  accuracy  and  re- 
stricting ourselves  to  what  we  are  certain  of.  But  when  we  come  to 
perform  the  actual  calculations  we  find  that  we  are  again  assuming  a 
form  of  distribution  of  the  mental  quantity,  usually  a  normal  distri- 
bution, but  sometimes  some  other,  as  in  the  "Foot-rule."  It  comes 
therefore  to  this,  that  in  calculating  rank-correlations  we  are  refusing 
to  take  the  distribution  found  and  are  substituting  for  it  one  not  found. 
The  question  then  is  whether  it  is  more  probable  that  the  assumed  or 
the  measured  distribution  corresponds  to  the  actual  unknown  mental 
distribution.  I  think  that  each  case  would  have  to  be  taken  on  its 
merits:  but  I  do  feel  that  in  the  actual  measurement  the  attempt 
should  be  made  to  approximate  to  the  true  distribution,  by  comparing 
differences  carefully,  as  I  suggest  on  pp.  11 — 12. 

However,  I  wish  to  make  it  clear  that  there  is  not,  on  this  question 
of  the  use  of  ranks,  that  sharp  cleavage  of  opinion  between  Professor 
Spearman  and  myself  which  cannot  be  disguised  in  the  case  of  the 
General  Factor  argument. 

A  third  question  is  that  of  the  correction  of  raw  correlation  coef- 
ficients, in  which  I  share  to  a  considerable  extent  Dr  Brown's  misgivings 
as  to  Professor  Spearman's  formula,  and  on  which  Dr  Brown  has,  since 
the  first  edition  of  this  book,  added  experimental  evidence  as  to  that 
correlation  of  errors  and  true  values  which  the  Spearman  formula 
assumes  to  be  zero  (see  pp.  160 — 3). 

My  thanks  are  due  to  the  President  and  Council  of  the  Royal  Society, 
and  to  the  Editors  of  the  British  Journal  of  Psychology,  the  American 
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Journal  of  Psychology,  the  Psychological  Review,  and  Biometrika,  for 
their  kind  permission  to  reproduce  long  extracts  from  articles  in  the 
publications  controlled  by  them:  to  Professor  F.  M.  Urban  and  his 
publisher  Mr  W.  Engelmann  for  permission  to  print  Urban's  Tables  for 
the  Constant  Method:  to  Dr  E.  G.  Boring  of  Clark  University  for  valuable 
suggestions:  to  Dr  G.  J.  Rich  for  his  permission  to  print  his  checking 
table:  to  my  colleague  Dr  G.  R.  Goldsbrough  for  reading  some  mathe- 
matical pages  of  the  proofs :  and  to  Professor  Spearman  for  reading  the 
proofs  from  p.  65  onwards,  and  the  MS.  of  this  preface.  Acknowledg- 
ments to  authors  cited  are  made  by  footnotes,  but  should  any  omission 
in  this  or  any  other  respect  have  occurred,  my  sincere  apologies  are 
tendered  for  the  mistake. 

I  have  welcomed  the  opportunity  which  this  book  has  afforded  of 
printing  in  one  volume  arguments  which,  after  a  delay  due  to  four 
years'  military  service,  I  was  compelled  to  publish  piecemeal,  during 
1919  and  the  earlier  months  of  the  present  year,  in  various  scattered 
papers:  and  I  would  like  finally  to  thank  Dr  Brown  himself  for  per- 
mitting his  book  to  be  thus  mauled  by  another  and  will  close  as  I  began 
by  begging  that  none  of  my  crimes  may  be  visited  on  his  head. 


G.  H.  T. 


ARMSTRONG  COLLEGE, 

NEWCASTLE-UPON-TYNE. 
August,  1920. 
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PARTI 
PSYCHOPHYSICS 

CHAPTER  I 

MENTAL  MEASUREMENT 

Equal  appearing  intervals — Just  perceptible  distances — The  interpretation  of 
Weber's  Law — Indirect  methods  of  measurement — The  approach  to  measurement 
by  means  of  grading  magnitudes  and  their  differences. 

(1)  EQUAL  APPEARING  INTERVALS 

THE  pre-conditions*  of  measurement  in  any  sphere  of  experience  are 
(1)  the  homogeneity  of  the  phenomena,  or  of  any  particular  aspect  of  it, 
to  be  measured,  (2)  the  possibility  of  fixing  a  unit  in  terms  of  which  the 
measurement  may  be  made,  and  of  which  the  total  magnitude  may  be 
regarded  as  a  mere  multiple  or  sub-multiple.  These  pre-requisites  are 
satisfied  in  the  cases  of  spatial  and  temporal  magnitudes,  in  terms  of 
which,  directly  or  indirectly,  all  the  measurements  of  the  physical 
sciences  are  expressed.  It  was  thought  by  Fechner  that  they  are  also 
satisfied  in  the  case  of  the  strictly  psychical  phenomena  of  sensation- 
intensity,  i.e.  it  was  assumed  that  any  given  sensation-intensity  might 
be  regarded  as  made  up  of  a  sum  of  unit  sensation-intensities.  This 
view  has  been  definitely  rejected  by  later  psychologists.  Every  sensation- 
intensity  is  qualitatively  distinct  from  every  other  sensation-intensity. 
No  addition  or  subtraction  of  intensities  is  possible.  "  To  introspection, 
our  feeling  of  pink  is  surely  not  a  portion  of  our  feeling  of  scarlet;  nor 
does  the  light  of  an  electric  arc  seem  to  contain  that  of  a  tallow-candle 
in  itself"  (James)")*.  Fechner's  mistake  was  due  to  a  confusion  of 

*  These  pre-conditions  are  those  usually  stated.  But  the  idea  of  measurement  has 
been  so  expanded  during  recent  generations  by  the  mathematical  ideas  of  continuity, 
infinity  and  limit  that  they  are  becoming  inadequate  as  a  statement  of  the  position. 
Compare  the  last  section  of  the  present  chapter,  on  the  approach  to  measurement  by 
means  of  grading  magnitudes  and  their  differences. 

f  Principles  of  Psychology,  I.  p.  546. 
B.  &  T.  1 
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sensation-intensities  with  the  (physical)   stimulus-values  required  to 
produce  them. 

Nevertheless,  purely  psychical  measurement  is  not  entirely  im- 
possible. Within  any  one  series  of  sensation-intensities,  e.g.  a  series  of 
greys,  the  contrasts  or  "  distances  "  separating  different  pairs  of  intensities 
are  perfectly  homogeneous  with  one  another  and  can  be  measured  in 
terms  of  one  another  or  in  terms  of  an  arbitrarily  chosen  unit  of  "  sense- 
distance."  Given  two  brightness-intensities  a  and  6,  it  is  quite  possible 
to  find,  within  limits  of  error,  a  brightness-intensity  c  which  is  as  much 
higher  than  b  in  the  scale  of  intensities  as  b  is  than  a,  i.e.  such  that  the 
sense-distance  6c  =  the  sense-distance  ab\  or,  again,  it  is  quite  possible, 
theoretically,  to  find  a  brightness-intensity  d  which  bisects  the  sense- 
distance  a&,  i.e.  which  is  such  that  it  is  as  far  removed  from  a  in  the 
scale  of  intensities  as  b  is  from  it — in  symbols,  ad  =  db.  Hence  the 
"distance,"  or  disparity,  of  b  from  a  is  twice  that  of  d  from  a,  the  distance 
of  c  from  a  is  four  times  that  of  d  from  a.  If,  now,  ad,  or  the  distance 
of  d  from  a,  be  taken  as  a  conventional  unit,  the  values  of  ab  and  ac 
will  be  2  and  4  respectively*. 


Scale  of 
brightness-intensities 

Fig.  1 

A  scale  of  intensities  may  in  this  way  be  formed  rising  by  "equal- 
appearing  intervals"  or  sense-distances,  and  the  magnitude  of  any  given 
interval  may  then,  theoretically,  be  read  off  in  terms  of  the  unit-distance 
employed  in  the  construction  of  the  scale.  In  practice,  however,  it  is 
found  more  convenient  to  fix  the  successive  scale-marks,  the  successive 
members  of  the  intensity-series,  in  terms  of  their  corresponding  stimulus- 
values.  It  has  been  found  by  experiment  that  in  the  case  of  light- 
intensities  and  sound-intensities  the  successive  stimulus-values  form, 
with  fair  approximation,  a  geometrical  progression,  or,  in  other  words, 
each  stimulus-value  divided  by  the  immediately  preceding  one  gives 
approximately  the  same  quotient.  From  the  stimulus-values  corre- 
sponding to  an  ascending  series  of  eight  equidistant  brightness-values, 
Ebbinghaus  obtained  the  following  series  of  quotients: 

2-3     2-1     2-1     1-8     1-7     1-7     2-0 

*  This  view  of  mental  measurement  in  terms  of  sense-distance  first  originated  with 
J.  R.  L.  Delboeuf,  Revue  Philosophigue,  1878,  v.  p.  53.  His  term  for  sense-distance  was 
"contrasts  sensible. 
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The  quotient  value  is  not  entirely  constant,  being  slightly  greater 
towards  the  two  ends  of  the  scale  than  it  is  at  or  about  the  middle.  For 
this  central  region,  then,  the  general  relation  of  the  sense-distance  to 
the  stimulus-values  is  given  by  the  logarithmic  formula : 

,.  ,          -Y-      7  ,      stimulus  at  b 

bense-distance  ab  =  k  log  — -, —     — , 

&  stimulus  at  a 

where  the  stimulus  at  a  is  one  which  gives  any  finite  intensity  of  sensa- 
tion taken  as  the  starting  point  or  conventional  zero  (N.B.  it  is  not 
necessarily  liminal) ;  the  stimuli  at  a  and  b  are  those  which  correspond 
to  the  sensation-intensities  of  which  ab  is  the  "contraste  sensible"  or 
sense-difference. 

(2)  JUST  PERCEPTIBLE  DISTANCES 

A  mode  of  procedure  which  not  only  admits  of  much  wider  practical 
application  than  the  above-mentioned  "method  of  mean  gradations," 
but  also  possesses  a  peculiar  historical  importance,  is  that  which  is 
concerned  with  the  determination  of  the  stimulus-increments  corre- 
sponding to  just-noticeable  increments  of  sensation-intensity  in  different 
parts  of  the  intensity  scale.  Weber  found,  in  a  series  of  experiments 
chiefly  with  lifted  weights,  that  this  stimulus-increment  was  relatively, 
not  absolutely,  constant  for  different  regions  of  the  intensity  scale,  i.e. 
that  the  stimulus  corresponding  to  any  original  sensation-intensity  had 
always  to  be  increased  by  a  constant  proportion  to  arouse  a  just-noticeable 
increment  of  the  sensation-intensity.  If  a  103  grams  weight  is  just 
noticeably  heavier  than  a  100  grams  weight,  then  the  weight  just 
noticeably  heavier  than  a  200  grams  weight  will  be  a  206  grams  weight, 
not  203  grams. 

Mathematically  formulated,  Weber's  Law  is : 

8  (stimulus) 

— -—. -, —  =  a  constant. 

stimulus 

The  quantity  8  (stimulus)/stimulus — fa  or  -03  in  our  example  of  weight- 
lifting — is  known  as  the  "relative  difference  limen."  It  is  of  course  the 
average  value  of  a  considerable  number  of  determinations. 

Fechner  verified  Weber's  Law  in  many  different  realms  of  sensation- 
intensity,  and  made  it  the  basis  of  his  own  system  of  mental  measure- 
ment. This  he  did  by  making  the  following  three  assumptions: 

(1)  that  a  sensation-intensity  is  a  measurable  magnitude  and  may 
therefore  be  regarded  as  a  sum  of  unit-intensities ; 

(2)  that  just-noticeable  differences  of  sensation-intensity  are  equal 

1—2 
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at  different  parts  of  the  stimulus  scale,  and  may  therefore  conveniently 
serve  as  the  unit-intensities  above-mentioned; 

(3)  that  the  just-noticeable  difference  of  sensation  may  be  treated 
as  a  difference  of  two  sensations,  or  at  least  that  if  Weber's  Law  applies 
to  the  former  ("sensed  difference")  it  will  also  apply  to  the  latter 
("difference  sensation"). 

On  the  basis  of  Weber's  Law  and  these  added  assumptions,  Fechner 
obtains  the  following  formula,  viz. 

d  (stimulus) 

a  (sensation)  =  c  -—  ^  -  -,  --  -  , 
stimulus 

which  he  calls  the  fundamental  formula  for  mental  measurement.  Inte- 
grating, this  becomes 

sensation  =  c  loge  stimulus  -f  C. 

Putting  the  stimulus  in  this  equation  equal  to  the  stimulus  T  for  which 
the  sensation  is  just  below  the  threshold  of  consciousness,  i.e.  =  0,  we 

have       :  : 


Subtracting  the  second  from  the  first  equation, 

,.  T       stimulus 

sensation  =  c  loge  —  ~  —  . 

Putting  T  =  1,  and  transferring  to  the  ordinary  logarithm  system,  we  get 
sensation  =  k  log  stimulus. 

All  the  assumptions  involved  are  questionable.  The  first  one  has 
already  been  considered  at  some  length.  It  is  not  the  single  sensation- 
intensity  which  is  measurable,  but  the  distinctness,  disparity  or  distance 
of  one  sensation-intensity  from  another. 

We  must  therefore  regard  the  just-noticeable  difference,  not  as  a 
difference  of  two  sensation-intensities  but  as  a  minimal  sense-distance, 
if  we  are  to  be  able  to  make  use  of  it  in  our  scheme  of  mental  measure- 
ment. This  modification,  however,  still  leaves  us  involved  in  the  diffi- 
culties of  assumptions  (2)  and  (3). 

Fechner's  own  reason  for  regarding  all  just-noticeable  differences 
belonging  to  any  one  scale  of  intensities  as  equal  was  that  they  appear 
equal  to  introspection.  Introspection  in  a  case  like  this  is  obviously 
difficult,  even  for  the  most  skilful  observers,  and  its  verdict  cannot, 
therefore,  be  greatly  relied  upon.  Theoretically  it  is  quite  conceivable 
that  just-noticeable  differences,  though  equivalent  to  one  another  as 
being  all  just-noticeable,  i.e.  as  being  sense-  distances  so  small  that  the 
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slightest  diminution  of  them  would  cause  them  all,  equally,  to  cease  to 
be  noticeable,  yet  as  noticed  or  perceived  would  appear  of  different 
magnitude  one  from  another.  Ebbinghaus  points  to  the  analogous  case 
of  differentials  or  infinitesimals  in  mathematics.  These  are  all  equivalent 
to  one  another  as  being  all  equally  negligible  as  compared  with  finite 
magnitudes,  yet  are  by  no  means  necessarily  equal  to  one  another.  If 
they  belong  to  different  "orders,"  those  of  a  higher  order  are  negligible 
as  compared  with  those  of  a  lower,  etc.  Again,  "the  least  distances 
perceived  as  such  at  different  parts  of  the  skin  or  in  direct  and  indirect 
vision  do  not  by  any  means  all  appear  as  equal  magnitudes.  On  the 
contrary,  so  soon  as  they  come  to  consciousness  as  distances  they  are 
at  once  perceived  as  distances  of  varying  size,  in  a  certain  approximation 
to  their  objective  differences*."  In  spite  of  these  considerations, 
Ebbinghaus  regards  the  correspondence  of  the  stimulus  results  obtained 
for  equal  appearing  intervals  and  for  just  perceptible  intervals  in  the 
case  of  brightness-intensities  (in  the  middle  region  of  the  scale  both 
series  of  stimuli  form  a  geometrical  progression)  as  sufficient  evidence 
for  the  approximate  equality  of  the  latter  intervals.  Miiller  and  Wundt 
had  previously  advanced  the  same  argument. 

Several  experimental  investigations  have  been  made  with  the  express 
purpose  of  testing  the  relation  of  the  methods  of  minimal  change  and 
mean  gradations.  Titchenerf  sums  up  the  theoretical  basis  of  such 
experiments  concisely  as  follows:  "There  are  in  reality  two  possible 
ways  of  working.  (1)  We  might  take  a  series  of  stimulus- values,  corre- 
sponding to  a  series,  say,  of  eight  successive  just-noticeable  differences  of 
sensation,  and  thereafter  directly  compare  the  two  half-distances,  of  four 
just-noticeable  differences  each,  and  decide  upon  their  equality  or  in- 
equality. This  would  be  a  direct  method  of  experiment."  "  Or  (2)  we  might 
determine  a  few  just-noticeable  differences  of  sensation  at  different  parts 
of  the  stimulus  scale,  in  order  to  establish  the  constancy  of  the  relative 
difference  limen,  and  thereafter  work  with  supraliminal  differences,  and 
decide  whether  the  same  constancy  holds.  This  would  be  an  indirect 
method;  it  is  the  method  indicated  by  the  authors  just  cited  [Miiller, 
Wundt,  Kohler  and  Tannery].... Either  of  these  two  methods  would, 
presumably,  take  us  to  our  goal.  The  experimental  work  would  be 
exceedingly  difficult.  Liminal  determinations  are  always  and  intrinsi- 
cally difficult;  and,  further,  the  judgments  passed  upon  just-noticeable 
differences  and  upon  supraliminal  differences  are,  even  under  the  most 

*  Ebbinghaus,  Grundziige  der  Psychologic,  2nd  ed.,  1905,  p.  524. 
f  "Experimental  Psychology,"  u.  Instructor's  Manual,  p.  Ixxvii. 
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favourable  conditions,  the  expressions  of  radically  different  mental 
attitudes." 

It  should  be  mentioned  here  that  the  principal  rival  to  Fechner's 
hypothesis — the  "difference  hypothesis" — is  that  first  formulated  by 
Plateau,  and  generally  known  as  the  "quotient  hypothesis."  Plateau 
adopted  the  psychophysical  formula 

sensation  =  c  (stimulus)* 

on  the  basis  of  experiments  by  the  method  of  mean  gradations.    This 
implies,  in  the  place  of  Fechner's  fundamental  formula,  the  formula 
8  (sensation)  _  ,  8  (stimulus) 

sensation  stimulus 

in  other  words,  it  assumes  that  just-noticeable  differences  are  relatively, 
not  absolutely,  equal  sensation-magnitudes.  Although  Plateau  himself 
withdrew  his  formula  later,  the  "quotient  hypothesis"  still  remains  as 
the  rival  of  Fechner's  "difference  hypothesis." 

To  return  to  the  experiments.  Merkel  (1888)  worked  with  bright- 
nesses, pressures,  and  noises,  and  found  that  the  stimulus  corresponding 
to  the  sensation  bisecting  a  supraliminal  sense-distance  was  the  arith- 
metical mean  of  the  stimuli  corresponding  to  the  two  terminal  sensations, 
which  would  seem  to  support  the  quotient  hypothesis,  though  such  an 
inference  is  not  entirely  free  from  objection.  Angell  (1892)  worked  with 
noise-intensities  by  the  method  of  mean  gradations,  avoided  certain 
sources  of  error  present  in  Merkel's  form  of  procedure,  and  obtained 
results  supporting  the  difference  hypothesis,  the  stimulus  of  the  bisecting 
sensation  being  the  geometrical,  not  the  arithmetical,  mean  of  the 
terminal  stimuli. 

A  more  thorough  investigation  was  carried  out  by  W.  Ament*  in 
1900.  He  used  a  series  of  Marbe  greys  for  the  brightnesses  and  employed 
the  direct  method;  for  the  noise- intensities  he  used  a  Fechner  sound 
pendulum  and  worked  principally  by  the  indirect  method.  The  result 
reached  supported  the  quotient  hypothesis.  But  Ament' s  work  has  not 
escaped  criticism.  A  repetition  of  his  experiments  on  brightness- 
intensities  by  Frobesf  has  failed  to  confirm  his  results.  Ebbinghaus 
also  has  found,  in  careful  experiments  with  rotating  sectors,  that  just- 
noticeable  differences  in  different  parts  of  the  scale  of  brightnesses  are 
equal  to  one  another.  On  the  whole,  therefore,  the  balance  of  evidence 
seems  to  be  in  favour  of  the  "difference  hypothesis." 

*  W.  Ament,  "Ueber  das  Verhaltnis  der  ebenmerklichen  zu  den  uebermerklichen 
Unterschieden  bei  Licht-  und  Schall-intensitaten,"  Phil  Studien,  1900,  xvi.  pp.  135  ff. 
f  Zeitechrift  fur  Psychologic,  xxxvi.  p.  344. 
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(3)  THE  INTERPRETATION  OF  WEBER'S  LAW 

Fechner's  third  assumption — see  above,   p.   4 — brings  us  to  the 
question  of  the  interpretation  of  Weber's  Law. 
There  are  three  general  forms  of  interpretation : 

(1)  the  psychophysical  (Fechner), 

(2)  the  physiological  (Miiller,  Ebbinghaus,  James,  etc.), 

(3)  the  psychological  (Wundt). 

According  to  (1),  the  logarithmic  transition  takes  place  in  passing 
from  the  physiological  changes  in  the  sensory  centres  of  the  cerebral 
cortex  to  the  corresponding  sensation-intensities.  The  chief  objection 
to  this  view  of  Fechner's  is  that  Weber's  Law  is  not  exact.  This  same 
consideration  supports  (2),  or  the  physiological  view,  according  to  which 
the  transition  occurs  either  at  the  inception  of  the  stimulus  in  the  sense 
organ,  or  somewhere  between  the  nerve  endings  and  their  central  con- 
nections in  the  sensory  areas  of  the  cortex.  Experimental  results 
obtained  by  Waller*  and  Steinachf  are  in  favour  of  this  view.  Waller 
stimulated  a  frog's  eye  with  light  of  different  intensities,  and  found  that 
the  corresponding  "negative  variations"  set  up  in  the  optic  nerve  varied 
in  intensity  as  the  logarithm  of  the  stimulus-values  (approx.).  Steinach 
obtained  similar  results  on  stimulating  the  skin  of  a  frog's  thigh  with 
weights  and  noting  the  negative  variation  in  the  attached  nerve.  If  the 
negative  variation  may  be  assumed  to  be  proportional  to  the  intensity 
of  the  nerve-current  passing  along  the  nerve,  these  results  point  to  the 
conclusion  that  the  logarithmic  transition  occurs  in  the  sense  organ  and 
its  sensory  nerve  endings. 

EbbinghausJ  has  constructed  a  theory  based  upon  the  conception 
of  varying  degrees  of  dissociability  of  complex  molecules  to  account  for 
the  law  and  also  for  the  deviations  from  it  towards  the  two  extremes  of 
the  intensity  scale. 

The  psychological  view,  (3),  of  the  law,  held  by  Wundt,  regards  it 
as  a  special  case  of  the  general  psychological  "law  of  relativity." 
Stimulus,  physiological  process,  and  pure  sensation-intensity  increase  in 
simple  proportion  to  one  another.  The  logarithmic  transition  occurs  in 
passing  from  mere  sensation  and  sensation  difference  to  apperceived 
sensation  and  apperceived  sensation  difference,  and  the  intensities  are 

*  Waller,  "Points  relating  to  the  Weber  Fechner  Law,"  Brain,  1895,  xvm.  p.  200. 

f  Steinach,  "  Elektromotorische  Erscheinungen  an  Hautsinnesnerven  bei  adaquater 
Reizung,"  Pflugers  Arch,  1896,  Bd.  63,  S.  495. 

%  Ebbinghaus,  "  Ueber  den  Grund  der  Abweichungen  von  dem  Weber' schen  Gesetz 
bei  Lichtempfindungen,"  PflugeSs  Arch.  1889,  Bd.  64,  S.  113. 
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apperceived  always  in  relation  to  one  another.  In  addition  to  the 
objection  that  it  regards  the  sensation-intensities  themselves,  not  their 
distances  from  one  another,  as  measurable  magnitudes,  this  view  is  also 
open  to  the  criticism  that  it  furnishes  no  explanation  of  the  widely 
varying  size  of  the  relative  difference  limen  in  different  sense-depart- 
ments (e.g.  DL  for  brightness-intensities  =  y^j,  DL  for  sound-intensities 
=  about  J) ;  the  physiological  view  sees  in  the  varying  structure  of  the 
different  sense  organs  the  adequate  explanation  of  this.  Moreover,  the 
psychological  view  has  no  completely  satisfactory  explanation  to  give 
of  the  deviations  from  Weber's  Law  so  frequently  met  with. 

These  objections  and  difficulties  make  the  view  improbable,  but  by 
no  means  prove  it  to  be  impossible.  It  has  the  great  merit  of  emphasising 
more  definitely  than  was  heretofore  the  case  the  importance  of  the  more 
purely  psychological  factors  in  psychophysical  experiments — in  par- 
ticular, it  brings  into  prominence  the  distinction  between  mere  disparity 
of  sensation-intensities  present  simultaneously  or  in  immediate  succession 
in  the  same  consciousness,  and  the  perception  of  this  disparity,  the 
discrimination  of  the  intensities  one  from  another.  In  psycho-physical 
experiments  the  subject's  consciousness  is  not  limited  to  the  mere 
sensational  level. 

In  this  connection  the  distinction,  explained  in  Chapter  III  of  the 
present  book,  on  page  75,  between  two  essentially  different  measures 
of  a  subject's  fineness  of  discrimination,  is  not  without  importance,  for 
those  two  measures,  as  far  as  experimental  work  goes,  are  both  believed 
to  obey  Weber's  Law.  The  one  is  the  difference  threshold  spoken  of  in 
the  present  discussion,  the  other  is  there  defined  and  named  the  inter- 
quartile range  of  the  point  of  subjective  equality:  and  the  two  quantities 
appear  to  differ  in  the  level,  sensational  or  perceptual,  at  which  they 
stand.  The  extended  idea  of  the  probability  of  a  judgment  there  defined 
throws  this  into  prominence. 

Another  fact  which  would  seem  to  have  a  very  immediate  bearing 
on  the  question  of  the  interpretation  of  Weber's  Law  is  that  plants  in 
their  response  to  the  stimulus  of  gravity  (Geotropism)  appear  to  obey 
that  law*. 

Although  with  continuous  increase  of  stimulus-intensities  the  corre- 
sponding sensation-intensities  rise  in  steps,  each  representing  a  just- 
noticeable  difference,  the  psychophysical  relation  is  really  a  strictly 

*  H.  Fitting,  Jahrbuch  f.  wiss.  Botanik,  Bd.  XLI.  1905;  F.  Darwin,  New  Phytologist, 
1906;  James  Small,  Annals  of  Botany,  xxxi.  April  1917;  James  Small,  Proc.  Roy.  Soc.  B, 
xc.  1918. 
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continuous  one,  as  becomes  at  once  obvious  if  we  consider  a  special  case. 
The  sensation-intensity  aroused  in  lifting  a  weight  of  100  grams  is 
"indistinguishable,"  as  we  say,  from  that  aroused  by  102  grams,  the 
sensation  aroused  by  102  grams  is  "indistinguishable"  from  that  aroused 
by  104  grams;  yet  the  sensation  aroused  by  100  grams  is  perceptibly 
different  from  that  aroused  by  104  grams.  Thus  the  sensation-intensity 
increases  continuously,  and  the  reason  why  this  is  not  immediately 
apparent  is  probably  to  be  looked  for  in  the  physiological  mechanism 
of  the  psychophysical  organism.  The  fact  is  that  statements  like  the 
above  as  to  two  sensations  being  "distinguishable"  or  not  lack  precision 
in  the  absence  of  a  definition  of  what  is  to  be  meant  by  distinguishable. 
To  introspection  almost  all  sensations  are  distinguishable  from  one 
another  inasmuch  as  we  seldom  will  agree  that  two  are  identical.  More- 
over, although  a  man  will  not  (in  the  case  of  a  subject  of  average  sensi- 
tivity) give  a  majority  of  answers  heavier  in  comparing  102  grams  with 

100  grams,  yet  the  number  he  does  give  will,  if  the  experiment  is 
sufficiently  carefully  performed,   be  greater  than  he  will  give  with 

101  grams  as  the  comparison  weight  (100  still  being  the  standard). 
Although  therefore  he  does  not  give,  either  with  101  or  with  102  grams, 
a  majority  of  answers  heavier  in  comparing  them  with  a  standard  of 
100  grams,  yet  he  does  distinguish  them  from  one  another  (if  we  take 
the  result  of  a  number  of  experiments),  giving  more  heavier  answers 
with  the  heavier  weight*. 

Delboeuf  held  that  the  limen  has  no  psychological  importance 
whatever.  If  this  is  an  extreme  view,  the  importance  which  Fechner 
attributed  to  the  limen  is  equally  extreme  in  the  other  direction. 

The  absolute  or  stimulus  limen  is  similar  in  kind  to  the  difference 
limen,  since  consciousness  is  never  empty  of  sensation-intensities  when 
such  a  limen  is  being  determined.  Here  again,  Fechner's  rigid  distinction 
of  the  two  was  a  fundamental  error. 

(4)  INDIRECT  METHODS  OF  MEASUREMENT 

The  preceding  account  has  probably  sufficed  to  show  that  purely 
psychical  measurement  is  a  conceivable  possibility.  Its  practical  appli- 
cation however  has  been  more  detailed  than  extensive.  A  more  generally 
useful  method  in  quantitative  psychology  is  that  which  measures  the 
external,  physical  or  physiological,  causes  and  effects  of  mental  process. 
The  measurements  are  made  in  terms  of  the  physical  units  of  space  and 

*  Compare  Poincare,  Science  and  Hypothecs,  Scott,  1905,  p.  22;  and  G.  H.  Thomson, 
British  Association  Reports,  1913,  paper  under  sub-section  I. 
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time,  yet  they  are  not  merely  physical  measurements,  since  they  derive 
all  their  significance  from  the  correlated  psychical  processes.  They  are 
indirect  psychical  measurements*.  Measurements  of  reaction-times, 
memory,  fatigue,  illusions,  etc.  are  all  of  this  nature.  Their  varieties 
are  innumerable,  and  are  illustrated  by  the  accounts  in  any  good  text- 
book of  experimental  psychology  (Sanford,  Titchener,  Myers).  In  all 
cases  full  introspective  accounts  are  essential,  and  when  correlated  with 
the  measurements  make  the  latter  essentially  psychical  measurements. 
Measurements  of  limina,  referred  to  in  the  previous  section,  are  of  the 
same  nature.  They  are  of  some  special  importance  as  being  measures  of 
sensory  acuity,  etc. — aspects  of  the  total  mental  ability  of  psycho- 
physical  organisms.  They  figure  prominently  in  many  researches  based 
on  the  use  of  "mental  tests." 

A  method  which  makes  a  partial  return  to  the  more  purely  psychical 
form  of  measurement  in  terms  of  "distance"  is  the  method  of  ranks  or 
grades.  Suppose  we  are  considering  the  relative  abilities  of,  say,  100  boys 
in  English  Composition.  We  should  find  it  difficult  to  mark  their  essays 
individually  in  terms  of  any  constant  unit  but  might  find  it  possible  to 
arrange  them  in  order  of  merit,  especially  if  we  had  sufficient  time  at 
our  disposal  to  employ  the  method  of  "paired  comparisons."  According 
to  the  procedure  of  this  latter  method,  the  essays  would  be  taken  in 
pairs,  quite  at  random,  and  the  better  essay  of  each  pair  would  be  given 
a  "  preference  mark."  This  procedure  would  be  repeated  again  and  again 
until  every  essay  had  been  compared  with  every  other  essay.  The  order 
of  merit  is  then  given  by  the  number  of  preferences  attaching  to  each 
essay.  In  this  order,  however,  we  cannot  assume  that  the  "ability- 
distance"  from  one  boy  to  the  next  is  a  constant  quantity.  The  boys 
near  the  extreme  ends  of  the  series  will  be  farther  removed  from  one 
another  than  the  boys  near  the  middle.  We  could  only  adjust  for  this 
if  we  knew  the  law  of  frequency- distribution  for  this  kind  of  ability  in 
this  particular  species  of  boy,  and  theoretically  the  determination  of 
this  distribution  depends  upon  a  prior  fixing  of  the  psychological  unit, 
the  unit  of  "ability-distance";  so  that  strictly  the  problem  is  insoluble. 
Since  however  under  certain  definite  and  indefinite  conditions  the  form 
of  distribution  in  a  large  number  of  biological  and  other  cases  of 
"physical"  measurement  has  been  found  to  be  either  Gaussian  (normal) 
or  differing  from  normal  in  ways  described  by  Pearson's  family  of 
frequency  curves,  we  might,  with  some  probability  of  being  near  the 
truth,  assume  the  normal  form  of  distribution  in  the  given  case  and  so 
*  See  Ebbinghaus,  Orundziige  der  Psychologic,  1905,  pp.  75,  76. 
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obtain  a  quantitative  measure  for  the  ability  of  each  particular  boy*. 
A  direct  psychological  determination  and  application  of  the  (conven- 
tional) unit-distance  is  not,  perhaps,  an  entirely  impossible  problem, 
and  work  in  this  direction  may  be  expected  and  if  achieved  would 
certainly  be  much  more  scientific  and  psychological  than  the  present 
method  of  measuring  in  terms  of  the  external  quantum  of  work  done. 

Finally,  the  interrelations  of  different  mental  abilities  within  any 
well-defined  group  of  individuals  situated  within  any  definite  environ- 
ment may  be  determined  by  means  of  the  technical  method  of  "correla- 
tion." A  correlation  coefficient  or  other  similar  constant  (e.g.  correlation 
ratio)  measures  the  tendency  towards  concomitant  variation  of  two 
mental  or  other  abilities  within  a  group  of  individuals.  The  result  may 
be  transferred  to  any  single  individual  within  the  group  as  measuring  the 
degree  of  probability  of  connection  of  the  two  abilities  in  the  particular 
case.  The  correlation  between  two  abilities  may  be  due  to  an  actual  direct 
relation  of  the  abilities  to  one  another,  or,  indirectly,  to  the  influence  of  a 
common  external  environment  upon  them  both.  The  first  of  these  two 
cases  is  perhaps  the  more  important,  but  the  possibility  of  the  second 
should  not  be  lost  sight  of,  and  it  also  has  a  special  interest  of  its  own. 
The  problems  of  correlation  will  be  considered  more  fully  in  a  later  chapter. 

(5)  THE  APPROACH  TO  MEASUREMENT  BY  MEANS  OF  GRADING 
MAGNITUDES  AND  THEIR  DIFFERENCES 

Since  the  publication  of  the  first  edition  of  this  book  an  important 
symposium  bearing  on  the  question  of  mental  measurement  has  been 
held  (in  1913).  The  exact  problem  submitted  to  the  joint  meeting  of 
the  Mind  Association,  the  Aristotelian  Society,  and  the  British  Psycho- 
logical Society,  was  "Are  the  Intensity-differences  of  Sensation  Quanti- 
tative?" Although  many  of  the  arguments  of  that  discussionf  are 
beyond  the  province  of  this  book,  it  is  of  interest  to  note  that  there  was 
a  general  consensus  of  opinion  that  sensation-intensities,  and  their 
differences,  are  at  least  "magnitudes"  which  can  be  graded,  even  if 
they  be  not  "quantities"  which  can  be  measured.  And  following  out 
further  a  suggestion  contained  in  a  quotation  from  Mr  Bertrand  Russell 
made  by  Professor  Dawes  Hicks  in  his  contribution  to  the  symposium, 
it  may  here  be  pointed  out  that  grading  leads,  if  the  differences  can 
also  be  graded,  to  something  almost  indistinguishable  from,  if  indeed  it 

*  This  was  done,  e.g.  by  Professor  Pearson  in  his  paper  in  Biometriktt,  1907,  v.  p.  105, 
and  the  example  has  been  followed  with  success  by  other  workers. 

t  The  papers  are  published  in  the  British  Journ.  of  Psychol  1913,  vi.  pp.  137—189. 
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be  not  identical  with,  true  measurement.  For  if  we  can  arrange  in  order  of 
magnitude  a,b,c,  ...  and  also  their  first  differences  a  —  6,  b  —  c,  c  —  d,  ... 
and  the  differences  of  these  differences  in  turn,  and  so  on,  we  can  space 
out  the  original  quantities  a,  6,  c,  ...  as  accurately  as  though  we  used 
a  unit  and  measured  them.  To  take  a  simple  example,  suppose  five 
quantities  a,  6,  c.  d,  e  have  really  the  measures  10,  16,  20,  31,  32.  If  an 
observer,  ignorant  of  these  measures,  only  knows  the  order  of  grading 
a,  b,  c,  d,  e  of  the  quantities,  he  has  already  made  a  considerable  advance 
even  although  he  does  not  know  the  spacing,  or  the  distances  apart. 
If  however  he  further  can  grade  the  differences,  that  is,  if  he  knows  that 
the  greatest  difference  is  that  between  'd  and  c,  and  that  the  others 
follow  in  the  order  6  —  a,  c  —  6,  e  —  d,  he  has  advanced  further  towards 
measurement,  in  the  sense  of  accurate  spacing,  although  there  are  still 
many  spacings  that  will  satisfy  these  gradings.  Thus  far  he  can  almost 
always  go  in  mental  phenomena.  And  although  it  is  practically  difficult, 
there  does  not  seem  any  theoretical  difficulty  about  taking  the  next 
step,  and  grading  the  differences  of  the  second  order.  In  our  example 
this  grading  is  ^  ~  c)  -  (b  -  a)  or  a,  say, 

(c-b)~  (e  -  d)   or  j8, 
(b  —  a)  —  (c  —  b)   or  y: 
and  the  order  of  the  third  differences  is 

(a-jS),       (jB-y). 

If  now  we  could  have  all  these  gradings  we  could  space  out  the  original 
quantities  very  closely  indeed  to  their  true  positions.  This  can  be  best 
seen  by  attempting  to  alter  some  one  of  the  values  while  leaving  all 
these  gradings  unaltered.  Make  d,  for  example,  29  instead  of  31  and 
although  the  order  a,  6,  c,  d,  e  is  unchanged,  and  also  the  order  of  the 
first  differences,  that  of  the  second  differences  is  completely  altered. 

With  an  infinite  number  of  quantities,  and  all  the  gradings  of  all 
their  differences,  we  should,  it  would  seem,  arrive  at  an  exact  solution 
of  the  problem,  so  that  grading  and  measurement  are  not  perhaps  so 
different  in  their  nature  as  might  at  first  be  thought. 

Indeed  a  case  could  well  be  made  out  for  the  thesis  that  the 
theoretical  objections  sometimes  brought  against  mental  measurement 
really  hold  in  the  last  resort  against  all  measurement,  and  prove  too 
much:  and  that  the  real  difference  between  mental  measurement  and 
physical  measurement  is  simply  that  mental  phenomena,  being  practi- 
cally more  difficult  to  handle,  force  on  our  notice  the  epistemological 
difficulties  inherent  in  all  measurement,  whereas  in  physical  measurement 
familiarity  has  bred  contempt. 


CHAPTER  II 

THE  ELEMENTARY  THEORY  OF  PROBABILITY 

Some  statistical  terms — Arithmetical  short-cuts — Measures  of  scatter — The  funda- 
mental theorem  in  probability — The  binomial  expansion — The  normal  curve  of  error 
— Fitting  a  normal  curve  to  distribution  data — The  method  of  least  squares. 

(1)  SOME  STATISTICAL  TERMS 

THE  theory  of  probability  was  developed  chiefly  from  two  classes  of 
material,  (a)  games  of  chance  such  as  coin  or  dice  throwing,  roulette, 
etc.,  and  (6)  statistics,  as  they  are  called,  that  is  such  collections  of 
quantitative  information  as  the  census,  trade  returns,  insurance  data  and 
the  like.  It  is  easy  to  see  that  psychological  experiments  frequently 
resemble  both  of  these  classes.  Any  experiment,  the  result  of  which 
depends  upon  a  human  decision,  has  much  in  common  with  the  throw 
of  a  die.  In  both  cases  it  often  seems  mere  chance  what  the  result  is, 
although  we  believe  that  in  both  cases  this  is  due  only  to  our  ignorance 
of  the  numerous  factors  at  work.  And,  since  this  is  so,  any  scientific 
experiment  on  the  actions  and  reactions  of  human  beings  must  neces- 
sarily be  repeated  many  times,  until  there  accumulates  a  mass  of 
quantitative  information  similar  in  many  respects  to  a  census  return. 

The  mass  of  quantitative  information  thus  accumulated  is  found 
upon  examination  to  have  certain  peculiarities  or  properties.  For 
example  consider  the  following  case: — The  experiments,  carried  out  by 
Professor  Urban  in  1906—7,  were  on  lifted  weights.  A  standard  weight 
of  100  grams  was  compared,  by  lifting  it,  with  weights  of  84,  88,  92, 
96,  100,  104  and  108  grams.  The  standard  weight  was  always  lifted 
first,  and  as  the  second  unknown  weight  was  lifted  the  judgment  lighter, 
equal  or  heavier,  was  given*. 

Suppose  the  following  answers  were  obtained  on  one  occasion : 

108  grams,  answer  heavier 

104  „  „  equal 

100  „  „  heavier 

96  „  „  lighter 

92  „  „  equal 

88  „  „  lighter 

84      „  „  lighter 

*  See  "Die  psychophysischen  Massmethoden  als  Grundlagen  empirischer  Messungen," 
by  F.  M.  Urban,  Archivfur  die  gesamte  Psyckologie,  1909,  xv.  p.  261. 
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In  this  series  the  lowest  answer  heavier  is  at  100  grams.  Let  this  experi- 
ment be  repeated  400  times,  and  in  every  series  let  the  position  of  the 
lowest  answer  heavier  be  recorded.  In  a  particular  case  the  distribution 
of  these  just  perceptibly  heavier  points  was  as  follows: 


Grams 
Frequency  ... 


84 
1 


88 
8 


92 
36 


96 

85 


100 
143 


104 
119 


108 

8 


These  particulars  are  shown  in  graphic  form  in  the  adjoining  figure 
where  the  points  have  been  joined  by  straight  lines  to  make  a  polygon, 
which  shows  at  once  the  chief  peculiarities  of  such  a  collection  of  data, 


150 


100 


I 


50 


84          88          92          96         100        104        108  grams 
Fig.  2.  A  cocked  hat  curve  (Urban' 8  Subject  III) 

namely  that  the  points  are  not  scattered  anyhow,  but  occur  most  fre- 
quently at  a  central  value  from  which  the  frequency  falls  off  in  both 
directions.  Such  a  figure  as  this  is  colloquially  termed  a  cocked  hat.  The 
point  where  the  summit  occurs,  or  point  of  greatest  frequency,  is  called 
the  mode.  Here  it  is  apparently  at  100  grams,  but  if  other  weights 
between  100  and  104  grams  could  have  been  examined,  it  might  be  found 
to  be  elsewhere.  The  figure  looks  as  though  a  smooth  curve  through  the 
points  would  bring  the  mode  somewhere  a  little  higher  than  100  grams. 
The  middlemost  of  the  400  positions  of  the  just  perceptibly  heavier 
point  is  called  the  median.  It  too  is  in  the  100  gram  group,  though  here 
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again  its  exact  position  is  uncertain.  A  better  known  central  value  (and 
one  which  can  be  more  exactly  calculated)  is  the  mean  or  average  value, 
found  thus: 

1  x    84=       84 

8x    88=     704 

36  x   92=  3312 

85  x   96=  8160 

143x100  =  14300 

119x104  =  12376 

8x108=     864 

Divide  by  400  j  39800 


99-5  grams 

In  the  above  cases  the  only  possible  values  of  the  sought  for  points 
were,  by  the  nature  of  the  experiments,  at  certain  definite  values.  In 
another  class  of  experiment,  however,  all  values  within  the  range  are 
possible  as  in  the  following  case.  Twenty- nine  experiments  were  made 
of  bisecting  a  line  by  eye*.  The  lengths  of  the  left-hand  half  of  the  line 
on  these  29  occasions  are  given  in  this  table,  arranged  in  order  of 
magnitude : 

63-8  mms. 


62-1 
61-4 
61-3 
61-2 
61-2  upper  Quartile 

6M 
61-0 
60-9 
60-8 
60-6 
60-4 
60-3 

60-0  Median 


59-9 
59-9 
59-6 
59-5 


59-2 

59-1  lower  Quartile 


59-0 
68-8 
58-7 
58-6 
58-2 
58-1 
57-6 

29)  1743-7 

60-13  Mean. 

*  An^experimentjperformed  for  the  purpose  of  illustrating  this  chapter.    Subject, 
G.  H.  Thomson. 
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Here  the  median  by  counting  is  60  mms.  and  the  mean  on  calculation  is 
found  to  be  60-13  mms.  In  this  case  the  "cocked  hat"  is  not  at  first 
sight  evident,  but  if  the  data  are  grouped  in  some  way  it  comes  to  light. 
For  example,  they  can  be  arranged  thus: 

One  reading  from  63  to  63-9  mms.  inclusive 


J.  VW\J   XC?<*U~LU.fclO 

Six 

UUH 

61 

61-9    „ 

Six        „ 

60 

60-9    „ 

Eight    „ 

59 

59-9    „ 

Five      „ 

58 

58-9    „ 

One  reading 

57 

57-9    „ 

Here  the  numbers  1,  2,  6,  6,  8,  5,  1  show  clearly  the  concentration  in 
the  middle,  although  being  smaller  they  are  not  so  regular  as  in  the 
previous  "cocked  hat."  Instead  of  concentrating  these  at  points  it 
seems  more  accurate  to  construct  a  diagram  such  as  that  here  given 


57          58          59          60          61 

Millimetres 


62          63          64 
Fig.  3.   A  histogram  formed  from  the  bisection  data 


where  a  rectangle  of  the  proper  height  is  constructed  over  the  corre- 
sponding base.  Such  a  figure  is  called  a  "histogram." 

This  example  is  convenient  for  explaining  two  new  terms,  namely 
the  "quartiles"  and  "semi-interquartile  range."  The  "quartiles"  are 
the  readings  which  are  one-quarter  distant  from  each  end,  just  as  the 
"median"  is  half-way.  In  the  table  on  page  15  there  are  29  readings. 

29  4-  1 
The  half-way  reading  is  -       -  or  15  from  either  end,  and  the  quartiles 


are  at  the  readings 


29  +  1 


or  7J  from  each  end,  for  which  we  can  take 


the  mean  of  the  seventh  and  eighth  readings  at  59-05  for  the  lower 
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quartile  and  61-15  for  the  upper  quartile.   The  semi-interquartile  range 
is  half  the  distance  between  the  quartiles.  It  is,  in  this  case, 
:',,, ..., ,6M6-59.Q6  =  ^ 

Clearly  the  semi-interquartile  range  is  a  crude  measure  of  the  scatter 
of  the  readings.  When  it  is  large  the  readings  are  more  scattered,  and 
therefore,  any  one  of  them  is  less  likely  to  be  correct,  than  if  the  semi- 
interquartile  range  had  been  small. 

The  mean  value  is  sometimes  called  the  expectation.  This  term  arises 
from  games  of  chance.  For  example,  suppose  I  am  playing  against  an 
opponent  at  a  game  of  dice,  in  which  my  opponent  has  to  give  me  as 
many  shillings  as  there  are  pips  in  a  single  throw  of  the  die,  then  the  sum 
which  I  ought  to  give  him  before  each  throw,  in  order  to  make  the  game 
an  even  one,  is  three  shillings  and  sixpence,  which  is  my  expectation  of 
gain.  It  is  not  the  most  probable  sum  for  my  opponent  to  give  me ;  indeed 
he  will  never  give  me  this  exact  sum,  since  he  always  pays  me  in  shillings 
and  not  in  pence.  But  at  the  end  of  a  sufficiently  large  number  of  throws 
I  shall  have  received  approximately  this  sum  per  throw  on  the  average. 

From  another  point  of  view  the  mean  will  be  found  to  correspond 
to    a    centre    of    gravity, 
namely  the  centroid  of  the 
curve  of  distribution  of  the   jj 
readings.  Let  the  adjoining 
figure  represent  such  a  curve, 

the    number    of    readings  -p.    4 

which  occur  within  a  portion 

AB  of  the  range  being  represented  by  the  area  included  between  the 
ordinates  at  A  and  B.  Then  the  whole  area  of  the  curve  will  represent 
2V  the  total  number  of  readings.  That  is 


where  the  integration  is  over  the  whole  curve.   Each  reading  is  repre- 
sented by  its  position  on  the  ic-axis,  and  the  sum  of  all  the  readings  is 

xydx.    . 
The  mean  is  therefore  given  by 

\xydx          1 

or   jq  \xydx. 
ydx 


\ 
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But  this  is  the  expression  for  the  abscissa  of  the  centre  of  gravity  of 
the  curve,  through  which  the  ordinate  above  the  mean  therefore  passes. 

(2)  SOME  ARITHMETICAL  SHORT-CUTS 

There  are  several  devices  which  considerably  shorten  the  arithmetical 
labour  of  finding  the  mean.  These  can  be  illustrated  by  the  example 
on  page  15,  near  the  top. 

Choosing  a  convenient  origin.  The  occurrence  of  large  numbers  in  the 
calculations  is  minimised  if  an  origin  is  chosen  within  the  actual  range 
of  distribution,  preferably  either  at  one  end  or  in  the  middle.  In  the 
example  we  have  in  mind,  the  point  100  grams  might  be  selected,  as  the 
large  number  143  is  thus  avoided.  The  calculations  would  then  appear 
as  follows: 


lx 

8x 
36  x 

85  x 

119  x 

8x 

-16 
-12 

-   8 
-   4 

4 

8 

-16 

-96 
-288 
-340 

-740 

476 
64 

540 
-740 

400)  -200 

-0-5 
origin  100 

+  99-5  Mean 

Choosing  a  convenient  unit.  Since  in  this  particular  case  the  measure- 
ments are  all  made  at  intervals  of  4  grams,  it  is  a  considerable  simplifica- 
tion if  this  is  chosen  as  the  unit.  The  same  calculation  then  appears  as 
follows : 

lx-4  -4 

8x-3  -24 

36  x  -2  -72 

85  x  - 1  -85 

-185 

119  x      1  119 

8x     2  16 

135 

-185 

400)  -  50 

-  -125 

4  grama  working  unit 

.-  -5 
origin    100 

+99-5  Mean 
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The  Summation  Method  of  finding  the  Mean 

This  is  only  applicable  in  cases  like  the  present  where  the  readings 
are  concentrated  at  a  number  of  equidistant  points.  The  calculation  then 
appears  as  follows : 


84  grama 

1 

400 

88  „ 

8 

399 

92  „ 

36 

391 

96  „ 

85 

355 

100  „ 

143 

270 

104  „ 

119 

127 

108  „ 

8 

8 

400~ 

)1950 

4-875 

4  grams  per  unit 


19-5 
origin  80 

99^5  Mean 

The  figures  in  the  second  column  are  the  continued  sum,  from  below 
upwards,  of  the  figures  in  the  first  column.  They  are  then  themselves 
added  up,  the  total  being  1950.  Now  it  is  clear  from  its  method  of 
formation  that  this  total  is  composed  as  follows: 

7x8  +  6x  119 +  5x  143 +  4x  85 +  3x  36 +  2x8  +  1x1. 
That  is  to  say,  it  corresponds  to  taking  the  origin  at  80  grams,  and  a 
unit  of  4  grams. 

It  is  instructive  to  try  this  calculation  from  the  other  end,  as  it  were, 
making  the  continued  sum  from  above  downwards.  The  reader  should 
try  other  modifications  of  the  idea  underlying  this  summation  method. 
For  example,  how  should  it  be  carried  out  in  order  to  place  the  origin 
at  84  grams? 

(3)  MEASURES  OF  SCATTER 

Let  us  consider  again  the  experiment  of  bisecting  a  line,  described 
on  p.  15.  Since  the  lines  bisected  measured  126  mms.,  the  true  half 
lay  at  63  mms.  The  mean  of  the  trials  was  60-13  mms.  so  that  an  error 
of  2-87  mms.  was  made.  This  error  does  not  in  itself  however  give  a 
complete  description  of  the  subject's  performance;  a  measure  of  scatter 
is  required.  One  such  is  the  semi-interquartile  range.  When  we  say  that 
the  semi-interquartile  range  is  1-05  mms.  we  mean  that  half  the  attempts 
at  bisection  lay  within  a  range  of  2-1  mms.,  and,  since  half  the  trials 
already  made  come  within  this  range,  the  probability  of  the  next  trial 
also  coming  within  it  is  one-half,  apart  from  practice  improvement. 

2—2 
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We  must,  however,  next  consider  other  and  more  usual  ways  of 
indicating  scatter,  although  the  use  of  the  semi-interquartile  range  as 
a  check  is  most  valuable,  since  it  is  a  quantity  about  which  glaring  errors 
are  not  likely  to  be  made. 

(1)  There  is  first  the  "mean  variation,"  a  measure  much  in  use 
among  psychologists  but  not  one  to  be  recommended.    In  this  the 
deviation  or  variation  of  each  reading  from  the  mean  is  written  down, 
and  the  mean  of  these  found,  disregarding  their  sign.   Taking  the  same 
example  of  bisecting  a  line  we  obtain 

Sum  of  deviations  regardless  of  sign 
M.V.  =  -          -^p-  -s-  =  1-14  mms. 

Number  of  cases  n 

The  mean  variation  is  often,  as  here,  about  the  same  size  as,  or  a  little 
larger  than,  the  semi-interquartile  range,  which  latter,  it  will  be  seen, 
is  indeed  the  median  variation,  from  the  median,  though  this  name  is 
not  used. 

(2)  The  "standard  deviation,"  generally  denoted  by  cr,  is  by  far 
the  best  measure  of  scatter  to  use,  for  reasons  which  will  gradually 
become  clear  in  the  process  of  studying  the  subject.    Its  long  title  is 
the  "root  mean  square  deviation"  and  it  is  obtained  by  squaring  each 
deviation,  adding  the  squares  all  together,  dividing  by  the  number  of 
readings  and  taking  the  square  root: 

ySum  of  squares  of  deviations 
Number  of  cases  n 

If  this  process  be  carried  out  the  value  o-  =  1  -38  will  be  found  for  our 
example. 

This  is  admittedly  a  longer  process  than  finding  the  mean  variation. 
It  can  be  simplified  by  the  following  device,  which  should  be  used 
whenever  the  mean  of  the  readings  is  a  number  involving  awkward 
decimals.  Instead  of  taking  the  deviations  from  the  real  mean,  here 
60-13  mms.,  let  them  be  taken  from  some  convenient  point,  here  say 
60  mms.,  chosen  by  mere  inspection.  Proceed  now  exactly  as  before, 
squaring  and  adding  the  deviations.  But  from  the  mean  of  the  squared 
deviations  must  be  deducted  the  square  of  the  distance  between  the 
real  mean  and  the  convenient  point  from  which  the  deviations  have 
been  taken  (see  example  opposite). 

The  proof  that  this  then  gives  the  mean  of  the  squares  of  the  real 
deviations  is  easily  obtainable  by  elementary  algebra. 

This  shows  one  very  important  property  of  the  mean,  namely,  that 
it  is  the  point  where  the  sum  of  the  squares  of  the  deviations  is  a 
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minimum:  for  the  provisional  mean  square  is  always  greater  than  the 
real  mean  square,  since  the  correction  subtracted  is  essentially  positive. 


Bisection  data  (page  15) 

Deviations  from 

60  mms.        Squares  of  preceding 

3-8 

14-44 

2-2 

4-84 

2-1 

4-41 

1-4 

1-96 

1-3 

1-69 

1-2 

1-44 

1-2 

1-44 

1-1 

1-21 

1-0 

1-00 

0-9 

0-81 

0-8 

0-64 

0-6 

0-36 

0-4 

0-16 

0-3 

0-09 

0-0 

0-00 

0-1 

0-01 

0-1 

0-01 

0-4 

0-16 

0-5 

0-25 

0-8 

0-64 

0-8 

0-64 

0-9 

0-81 

1-0 

1-00 

•2 

1-44 

•3 

1-69 

•4 

1-96 

•8 

3-24 

•9 

3-61 

2-4 

5-76 

29  )  55-71 

1-92  provisional  mean  square 

Subtract  0-132=  0-02 

1-90  real  mean  square 

In  this  section  we  have,  throughout,  taken  the  mean  as  the  central 
value  from  which  the  scatter  was  to  be  measured.  In  the  great  majority 
of  cases  this  is  the  practical  plan;  but  if  necessary,  measures  of  scatter 
from  some  other  central  value  could  be  used.  Indeed  as  has  been 
pointed  out,  the  semi-interquartile  range  is  the  median  of  the  deviations 
from  the  median. 

If  the  distribution  of  readings  is  represented  by  a  smooth  curve  as 
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in  the  diagram  on  p.  17,  then  it  will  be  seen  that  the  mean  square 
deviation  about  the  origin  is  given  by  the  expression 


so  that  we  have  the  equation,  symbolising  the  above  calculation, 


the  integrals  being  as  before  over  the  whole  of  the  distribution. 

In  practical  work,  instead  of  actually  giving  the  standard  deviation, 
it  is  more  usual  to  quote  a  quantity  called  the  "probable  error,"  which 
may,  for  the  present,  be  arbitrarily  defined  as  equal  to  -67449cr,  i.e.  it  is 
an  arbitrary  reduction  of  a.  To  the  meaning  of  this  reduction  we  shall 
return  presently. 

The  Standard  Deviation  about  the  True  Value 

In  the  section  above  we  defined  a  quantity  known  as  the  Standard 
Deviation  about  the  Mean.  From  another  point  of  view,  however,  we 
sometimes  require  the  value  of  the  Standard  Deviation  about  the  True 
Value  of  the  quantity  which  is  being  measured.  This  will  always  be  a 
little  larger  than  the  former  quantity,  unless  the  true  value  happens  to 
coincide  exactly  with  the  Mean.  This  follows  from  the  theorem  that 
the  sum  of  the  squares  of  the  deviations  is  a  minimum  at  the  Mean. 

Of  course  if  we  do  not  know  the  true  value  of  what  we  are  measuring, 
we  shall  be  unable  to  find  this  quantity  exactly,  but  it  can  be  shown 
that  it  is  approximated  to  if  we  divide  by  n  —  1  instead  of  n  after 
finding  the  sum  of  the  squares  of  the  deviations. 

Let  a  be  standard  deviation  about  mean  a,  and  a  standard  deviation 
about  true  value  A  . 

Let  alt  a2,  a3,  ...  an  be  the  readings. 

Let  elt  e2)  e3,  ...  en  be  the  true  errors  of  those  readings,  and  let  e  be 
the  error  of  the  mean  a. 

Then  a  -  a1  =  A  +  e  -  (A  +  e^  =  e  ~  elt 

a  —  a2  =  e  —  e2,  etc. 
Therefore 

^  =  S(a-  ak)2  =  S(e-  ek)2  =  ne2  -  2eS  (ek)  +  S  (ek2) 
n  n  n 

Now  no'2  =  S  (e2) 

and  ne  =  S  (ek), 
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so  that  we  have 

„•  =  a'*  +  <?  -  2e8(e^  -  a"  +  ^  -  2S*(e*} 
n  n2  n2 

=  a,2  _  S2  fa)  =  ^2  _!?M)  _  2S  fa*m) 

w2  w2  w2 

The  last  term  is  approximately  zero,  the  positive  errors  cancelling  the 
negative. 

We  have  finally 


(7  = 

It  will  be  seen  that  when  n  is  sufficiently  large  the  two  quantities 
become  practically  identical.  It  is  when  n  is  small  that  the  correction 
becomes  of  importance.  This  is  especially  seen  from  a  consideration  of 
an  extreme  case,  namely  where  only  one  measurement  is  made. 

Let  us  say  that  the  one  measurement  made  has  the  value  v.  Then 
the  mean  has  also  the  value  v,  the  deviation  is  zero,  and  the  standard 
deviation  is  therefore  the  square  root  of  zero  divided  by  unity.  That  is, 
the  standard  deviation  about  the  mean  is  zero.  The  standard  deviation 
about  the  true  value  is,  however,  by  the  above  rule,  indeterminate, 
being  the  square  root  of  zero  divided  by  zero. 

The  reciprocal  of  the  standard  deviation  is  frequently  used  as  a 
measure  of  the  accuracy  of  a  set  of  observations.  It  is  clear  from  the 
above  that  if  the  number  of  observations  is  small  it  is  the  standard 
deviation  about  the  true  value  which  must  be  used,  that  is  we  must 
employ  n  —  1  in  the  denominator  instead  of  n.  For  otherwise  the 
accuracy  of  a  single  measurement  would  be  infinite. 

The  Standard  Deviation  of  the  Arithmetical  Mean 

In  a  section  above  it  was  shown  that  the  mean  square  of  the  devia- 
tions of  a  set  of  readings  is  a  minimum  at  the  arithmetical  mean.  About 
any  other  value  distant  e  from  the  mean  this  mean  square  is  increased 
bye2. 

Now  the  mean  square  deviation  about  the  mean  is  a2.  But  the  mean 
square  deviation  about  the  true  value  is  n<j2/(n  —  1).  Therefore  the 
expected  square  deviation  of  the  mean  from  the  true  value  is  given  by 
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The  expected  square  deviation  (before  experience)  is,  however,  the  same 
thing  as  the  mean  square  deviation  (after  experience),  so  that  the 
standard  deviation  of  the  mean  about  the  true  value  is  therefore 


V(n  -  1) 
and  that  about  the  mean  of  the  means  will  be 

(T/^/n. 

The  standard  deviation  of  a  mean  is  therefore  obtained  by  dividing  the 
standard  deviation  of  the  whole  distribution  by  the  square  root  of  the 
number  of  readings. 

The  Standard  Deviation  of  Sum  or  Difference 

Let  2  be  a  quantity  whose  standard  deviation  is  crx,  and  y  a  quantity 
whose  standard  deviation  is  ay\  required  the  standard  deviation  of  the 
sum  x  +  y.  Let  mx  be  the  mean  value  of  x  and  my  the  mean  value  of  y. 
Then  any  single  value  of  the  sum  x  +  y  will  be  of  the  form 

MX  +  8*  +  mv  +  &y 

Moreover  the  mean  of  the  sum  x  +  y  is  equal  to  mx  +  my,  so  that  the 
deviation  of  the  above  reading  is 

3,  +  3,. 
The  mean  square  deviation  of  the  sum  x  +  y  is  therefore 

o\Hr  =  S  (8,  +  Sy)*/n  =  8  (ox*)/n  +  S  (8x3v)/n  +  8  (8y*)/n. 
The  quantity  S  (8X8V)  however  will  be  very  small  if  not  zero,  because 
the  factors  Sx  and  8V  are  not  connected  with  one  another  in  any  way, 
and  their  products  are  as  likely  to  be  positive  as  negative,  so  that  the 
positive  values  will  annul  the  negative  on  the  average.  We  have, 
therefore,  finally: 


Exactly  the  same  reasoning  holds  for  the  value  of  cra_v,  the  only 
change  being  in  the  sign  of  my  and  3V,  and  therefore  of  S  (SaA),  which 
is  however  zero.  The  final  result  is  the  same 

"«-,  =  VW  +  <v>). 

It  is  assumed  in  both  these  formulae  that  x  and  y  are  independent  and 
uncorrelated. 
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(4)  THE  FUNDAMENTAL  THEOREM  IN  PROBABILITY 

In  ordinary  usage,  when  we  say  that  an  event  is  probable  under 
certain  circumstances,  we  mean  that  it  is  more  likely  to  happen  than 
not  to  happen,  and  by  improbable  we  mean  that  it  is  more  likely  not  to 
happen  than  to  happen. 

If  we  cross-question  ourselves  as  to  why  we  think  an  event  is  probable 
we  often  find  that  it  is  because  we  have  more  frequently  found  it  to 
happen  than  to  fail  under  similar  circumstances  in  the  past. 

If  we  wish  to  apply  mathematical  treatment  to  probability  we  must 
decide  on  a  quantitative  measure  for  it.  We  do  so  by  using  a  fraction 
(vulgar  or  decimal)  for  this  purpose,  in  such  a  way  that  the  fraction 
rises  and  falls  with  the  probability,  becoming  unity  for  "certain  to 
happen"  and  zero  for  "certain  not  to  happen."  The  numerator  of  this 
fraction  is  the  number  of  equally  probable  ways  in  which  the  event  can 
happen,  while  the  denominator  is  the  total  number  of  equally  probable 
ways  in  which  the  event  can,  under  the  given  circumstances,  either 
happen  or  fail*. 

Thus,  for  example,  consider  the  probability  that  with  one  throw  of 
a  six-faced  die  a  score  of  more  than  four  will  be  obtained.  This  can 
happen  in  two  ways,  namely,  by  throwing  a  five  or  a  six.  The  total 
number  of  possible  throws  is  six,  and  therefore  the  probability  is  f  or  J. 

This  method  of  giving  a  quantitative  value  to  a  probability  is  clearly 
connected  with  the  method  adopted  in  betting.  For  instance,  odds  of 
"3  to  1  against"  a  certain  event  means  that  the  speaker  judges  that 
there  is  only  one  chance  of  success  to  three  chances  of  failure.  The 
fraction  representing  the  probability  of  success  is  therefore 

1       _1 
3+l~4* 

If  the  experiment  with  dice  mentioned  above  be  actually  performed 
a  large  number  of  times,  it  will  be  found  that  the  number  of  occasions 
on  which  the  score  exceeds  4  will  closely  approximate  to  one-third  of 
the  whole.  If  therefore  we  had  never  seen  dice  and  had  no  idea  of  their 
appearance,  but  were  told  that  a  large  number  of  throws  always  included 
about  one-third  which  were  over  fourf,  we  should  conclude  that  the 
probability  of  obtaining  a  throw  of  more  than  four  was  about  one-third. 

*  Cf.  however  Jeffries  and  Wrinch,  Phil.  Mag.  1919. 

t  The  late  Professor  Weldon  found  106602/(  12  x  26306)  =0-3377.    See  Phil.  Mag. 
June  1900,  article  by  Professor  K.  Pearson. 
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This  method  of  finding  probabilities,  by  deducing  them  from  a  large 
number  of  actual  experiments,  is  that  followed  most  frequently  in 
practice. 

For  example  if  the  points  of  an  aesthesiometer  are  applied  to  a 
subject's  forearm,  with  the  points  distant  3  cms.  from  one  another, 
the  average  subject  will  usually  recognise  that  two  points  are  present. 
Occasionally  however  he  will  only  feel  one  point.  What  is  the  proba- 
bility that  he  will  answer  two?  This  can  only  be  decided  by  experiment. 
In  an  actual  case,  150  trials  were  made  at  a  number  of  different  sittings. 
On  105  occasions  the  answer  two  was  returned.  If  the  conditions  have 
been  the  same  throughout  then  the  probability  of  an  answer  two 
under  these  conditions  is  |f§  =  0-7. 

If  the  probability  of  an  event  occurring  is  p  then  the  probability  of 
it  not  occurring  is  clearly  1  —  p,  for  which  q  is  often  written.  For  the 
event  is  certain  to  happen  or  not  to  happen,  and  therefore  the  two 
probabilities  must  add  up  to  "certainty"  that  is  to  unity.  Thus  in  the 
above  case  the  probability  of  an  answer  two  not  being  returned  is  0-3. 
Similarly  the  chance  of  throwing  an  ace  at  dice  is  J  and  of  not  throwing 
an  ace  is  f .  The  chance  of  obtaining  a  head  on  throwing  a  coin  is  J, 
and  the  chance  of  not  obtaining  a  head  is  J. 

Next  consider  the  probability  of  an  event  happening  twice  in  suc- 
cession, if  its  probability  is  p  for  one  occurrence.  Take  a  few  specific 
cases  first.  If  two  coins  are  thrown  or  one  coin  thrown  twice,  there  are 
four  equally  likely  things  than  can  happen,  namely : 

bead  head 

tail  tail 

head  tail 

tail  head 

The  probability  of  getting  two  heads  is  therefore  J  which,  it  will  be 
noticed,  is  equal  to  (£)2. 

Next  consider  two  throws  of  a  pix-faced  die.  There  are  here  no  less 
than  36  things  which  can  happen,  for  whatever  value  the  first  throw 
has  there  are  six  different  throws  of  the  second  die  which  can  be  associated 
with  it:  and  since  the  first  can  also  have  six  values  there  are  6  x  6  or 
36  combinations  possible.  Only  one  of  these  combinations  consists  of 
two  aces,  so  that  the  chance  of  throwing  two  aces  is  -fa  or  ( J)2.  Generally, 
if  the  chance  of  an  event  is  p  then  the  chance  of  it  occurring  twice  in 
succession  is  p2.  For  suppose  there  are  m  ways  in  which  it  can  happen, 
and  n  ways  in  which  it  can  fail,  then  p  =  m/(m  4-  n).  Also,  having 
happened  once,  there  are  m  ways  in  which  the  second  success  can  occur, 
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each  of  which  ways  might  be  associated  with  the  first  success ;  and  since 
there  are  also  m  ways  of  this  first  success  happening  there  are  in  all 
m2  ways  of  the  double  success  happening.  Similarly  there  are  in  all 
(m  +  n)2  ways  of  the  double  event  either  happening  or  failing,  and 
therefore  the  chance  of  a  double  success  is 

m2        _    2 
(m  +  rif  ~  P  ' 

Reasoning  exactly  similar  to  the  above  will  enable  the  reader  to 
convince  himself  of  the  truth  of  the  following  general  theorem: 

If  there  are  a  number  of  independent  events  a,  6,  c,  ...  etc.  and  their 
respective  probabilities  of  occurrence  are  pa,  p^,  pc,  ...  etc.,  then  the 
probability  of  all  occurring  is  the  product 

Pa  x  p*  x  pc  x  .... 

This  may  be  said  to  be  the  fundamental  proposition  in  probability.  Its 
use  will  be  recognised  best  by  an  example.  It  will  be  instructive  to  take 
a  real  psychometric  experiment,  that  already  described  in  the  experi- 
ment on  weight-lifting  on  p.  13.  The  subject  of  those  experiments  com- 
pared each  of  the  weights  with  the  standard  450  times.  The  results  of 
this  were  as  follows* : 

Grams  84        88        92          96        100        104        108 

No.  of  answers  heavier          0        11         32         100        212        402        431 

From  these  the  probability  of  the  subject  answering  "heavier"  at  any 
weight  can  be  calculated,  always  assuming  that  the  conditions  have 
throughout  remained  the  same,  by  dividing  the  above  numbers  by  450. 
We  thus  obtain  the  following: 

Grams  84  88  92  96  100  104  108 

Probability  of 


,     „,    0-0000       0-0244       0-0711       0-2222       0-4711       0-8933       0-9578 
answer  "heavier    ) 

Let  us  now  apply  our  theorem  on  the  combination  of  probabilities 
to  solve  the  following  problem : 

Let  the  weights  84,  88  grams  etc.  be  each  presented  once  to  the  subject. 
What  is  the  probability  that  the  lowest  answer  "heavier"  will  be  at  96  grams? 

The  probability  that  he  will  answer  "heavier"  at  96  grams  is  0-2222. 
The  probability  that  he  will  not  answer  "heavier"  at  92  grams  is 
1  -  0-0711  =  0-9289.  Similarly  the  probability  that  he  will  not  answer 
"heavier"  at  88  grams  is  0-9756,  and  at  84  grams  is  1-0000.  The  proba- 

*  Urban,  op.  cit.  p.  287,  Table  XI  (multiply  the  values  by  450).  Or  alternatively 
consult  Table  V,  p.  175,  of  The  Application  of  Statistical  Methods  to  the  Problems  of 
Psychophysics,  by  F.  M.  Urban,  Philadelphia,  1908. 
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bility  of  the  combination  happening,  namely,  an  answer  "heavier"  at 
96  and  none  below,  is,  therefore 

1-0000  x  0-9756  x  0-9289  x  0-2222  =  0-2014. 
The  actual  frequency  with  which  this  occurred  was  0-2125. 

(5)  IMPORTANCE  OF  THE  BINOMIAL  EXPANSION  IN  THE 
THEORY  OF  PROBABILITY 

By  a  not  unnatural  hypothesis,  which  has  been  widely  accepted  and 
has  proved  very  fruitful,  an  error  of  measurement,  such  as  that  made 
in  judging  the  half-way  point  in  a  line  in  an  experiment  above,  may  be 
looked  upon  as  the  resultant  of  a  large  number  of  small  circumstances, 
each  of  which  sometimes  sways  our  measurement  in  the  one  direction, 
sometimes  in  the  other.  This  hypothesis,  combined  with  the  fundamental 
theorem  of  probability  just  explained,  leads  to  the  use  of  the  binomial 
expansion  in  describing  distributions  of  error.  This  can  be  illustrated 
by  the  following  example: — Let  us  suppose  that  a  quantity  which  we 
desire  to  measure  has  really  the  value  13  J  units,  but  that  we  are  opposed 
in  our  efforts  to  measure  it  by  seven  "Djinns,"  each  of  whom  has  the 
power  of  displacing  our  measurement  by  one  half-unit.  Let  us  further 
imagine  that  each  of  these  mischievous  imps,  in  an  endeavour  to  prevent 
our  making  any  steady  measurement,  decides  that  he  will  add  or  deduct 
his  half-unit  according  to  the  throw,  heads  or  tails,  of  a  coin.  Whenever 
we  try  to  make  a  measurement,  therefore,  these  invisible  seven  will 
assemble  and  throw  each  his  coin  in  the  air.  If  all  the  coins  happen  to 
come  heads,  seven  half-units  are  cunningly  added  to  the  13J,  and  we 
obtain  a  measurement  of  17.  If  all  come  tails,  we  get  13  J  minus  3J, 
or  10.  If  on  another  occasion  five  are  heads  and  two  are  tails,  five  half- 
units  will  be  added  and  two  subtracted,  giving 

13J  +  f  -  f  =  15. 

An  actual  test  of  this  is  given  in  the  following  figures  and  diagram. 
Seven  coins  were  thrown  on  128  occasions,  and  each  time  the  proper 
number  of  half-units  was  added  (for  the  heads)  or  subtracted  (for  the 
tails)  from  13 J.  The  result  was  as  follows: 

0  heads  and  7  tails  occurred    2  times  giving  a  value  10  units 


1      „       ,,6 

8     „ 

„      11 

2      „       ,,5 

16     „ 

,       12 

3      „       ,,4 

38     , 

13 

40 
»          >»      •* 

43     „ 

,       14 

5      „       ,,2 

16    „ 

,      15 

6      „       ,,1 

2     „ 

,      16 

7      „       ,,0 

3    „ 

,       17 

Total  128 

This  distribution  is  shown  in  Fig.  5. 
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The  general  resemblance  between  the  diagram,  made  by  throwing 
coins,  and  previous  diagrams  which  represent  the  result  of  psychological 
experiments  is  not  surprising  if  we  consider  for  a  moment  what  the 
condition  of  such  experiments  are.  The  "Djinns"  which  oppose  our 
efforts  to  obtain  a  true  value  for  say  the  spatial  threshold  are  in- 
numerable. Some  are  in  the  fingers  of  the  experimenter,  and  make  him 
press  irregularly  on  the  aesthesiometer  points.  Others  cause  noises  to 
happen  in  the  neighbourhood  to  distract  the  subject's  attention.  Other 
Djinns  make  the  instrument  hot  one  day  and  cold  the  next,  others  live 
in  the  subject's  skin,  and  quite  a  lot  are  engaged  in  stirring  up  vivid 
imaginations  in  his  mind  so  that  he  feels  all  kinds  of  prickles  and 
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Fig.  5.   Number  of  heads  in  a  throw  of  seven  coins  in  128  repetitions 

tinglings  which  make  his  judgments  on  the  position  of  the  points  as 
erratic  at  times  as  is  the  throw  of  a  coin. 

In  the  figure,  there  is  shown,  in  addition  to  the  polygon  based  on 
experiment,  a  dotted  theoretical  polygon  to  which  we  now  turn  our 
attention.  Since  the  probability  of  obtaining  a  head  at  one  throw  is  J, 
the  probability  of  obtaining  seven  heads  in  a  throw  of  seven  coins  is 

(*)7  or  tfe- 

The  probability  that  a  certain  coin  will  give  a  tail  but  all  the  others 

heads,  is  (J)6  x  (1  —  J)  which  also  equals  yj^.  As  there  are  seven  coins 
in  all,  each  of  which  might  give  the  only  tail,  the  total  chance  of 
obtaining  six  heads  and  one  tail  is  yf^.  The  probability  that  two 
specified  coins  out  of  the  seven  will  give  tails,  the  other  five  giving  heads, 
is  (I)5  x  (1  —  J)2  =  T^.  The  number  of  ways  in  which  the  two  can  be 
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specified  is  21.  The  total  chance  of  obtaining  five  heads  and  two  tails  is, 
therefore,  21/128,  obtained  thus, 

7!     /1\5  /1\5     _21_ 
5!  2!  \2J  \2j  '"128' 

It  will  be  seen  that  the  above  probabilities  of  obtaining  seven,  six  or  five 
heads  are  the  first  three  terms  in  the  expansion  of  (J  +  J)7.  The  next 
term  will  similarly  be  found  to  be  the  probability  of  obtaining  four 
heads,  and  so  on.  We  thus  obtain  the  following  results : 


of  7  heads      1  in  128 

6 

7  „  „ 

5 

21  „  „ 

4 

35  „  „ 

3 

35  „  „ 

2 

21  „  „ 

1  head       7  „  „ 

no  heads     1  „  „ 

128 

It  is  these  numbers  which  are  shown  by  the  dotted  line  in  the  figure. 

In  general,  if  p  is  the  probability  of  an  event  succeeding,  and  q  of 
it  not  succeeding,  the  respective  chances  of  it  succeeding 

k,  k-  1,  k  -2,  ...3,  2,  1  or  0 

times  in  k  trials  are  given  by  the  terms  of  the  expansion  of  (p  -f  q)k. 

In  n  groups  of  k  trials  therefore  the  most  probable  numbers  of 
successes  in  the  various  groups  are 

k(k-l) 
<  +  kpk-1q  +     \ 

i .  & 

For  example  let  the  event  be  throwing  either  an  ace  or  a  six  with  a 
six-faced  die,  let  six  trials  be  made  in  each  group,  and  let  a  thousand 
groups  be  tried.  Then  the  above  expansion  is 


Therefore  the  number  of  times  a  throw  consisting  of  only  "aces"  or 
"sixes"  will  occur  in  1000  trials  with  six  dice  each  time  is  most  probably 

1000      1000 

— __  =  or  once,  to  take  the  nearest  integer.    In  the  same  way  we 

o  t  Zu 

get  this  table: 
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1000  throws 

of  six  dice 

each  time 

1000 
All     6  are    Aces  or  Sixes    on        "796"  or 

approx.      1 

occasion 

12000 
729     " 

„          16  occasions 

60000 

QO 

»>        »      »>       »>       f> 

729     " 

•»           o* 

M 

3  „        »»      »>       »»       „ 

160000 

220 

729      " 

2  „        „      „       „       „ 

240000 

329 

.  .                .  /—  . 

»>             1 

729      " 

,,     1  is  an  ace  or  six           „ 

192000 
729~  " 

„        264 

M 

None  are  aces  or  sixes           „ 

64000 
729     " 

88 

>» 

1000 

(Mode) 


Here  the  mode  is  at  two. 

What  is  the  mean11.  In  calculating  it  we  shall  take  the  exact  fractions 
in  the  above  table,  not  the  approximate  integers.  The  mean  is  then 
obtained  thus: 

6x     10004-729=     60004-729 

5x    120004-729=  600004-729 

4x   600004-729=2400004-729 

3x1600004-729=4800004-729 

2  x  240000  4- 729  =  480000  4- 729 

1  x  192000  4-  729  =  192000  4-  729 

Ox   640004- 729=nil 

Here,  therefore,  the  mean  coincides  with  the  mode. 
In  general  we  have  from  the  binomial  expansion 


kpk~lq 


Mean  equals 


1.2 


(k  -  2) 


1.2 


1.2 

=  kp  (p  4-  qf~l  =  kp  since  p  +  q  =  1. 

The  mode  can  also  be  deduced  from  the  expansion.  The  first  term  in 
the  expansion,  p,  corresponds  to  k  successes,  the  second  term  to  k  —  1 
successes,  and  so  on,  so  that  the  term  corresponding  to  any  number  m 


of  successes  is 


k\ 


m !  (k  —  m)  I 
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This  term  is  made  from  the  preceding  term  by  multiplying  the  latter  by 

m  +  1    q 
k  —  m  p 
and  the  succeeding  term  is  made  from  it  by  multiplying  by 

m  q 

k—  m+  1    p' 
It  will  therefore  be  the  largest  term  if 

W+M>1>         m         .i 
k  —  m  p  k  —  m  +\'  p* 

whence  qm  +  q  >  pk  —  pm,    pk  —  pm  +  p  >  qm, 

qm  +  pm  >  kp  —  q,     kp  +  p  >  qm  +  pm, 

m>  kp  —  q,     kp  +  p  >  m, 
so  that  kp  —  q  <  m  <  kp  -f-  p. 

The  greatest  term,  therefore,  is  that  which  corresponds  to  a  number 
of  successes  between  kp  —  q  and  kp  -\-  p.  The  range  thus  indicated  is 
unity  since  p  +  q  =  1.  In  the  binomial  expansion  therefore  the  mean 
and  the  mode  agree  to  an  integer. 

Standard  Deviation  of  the  Binomial  Expansion 

For  the  binomial  expansion  the  standard  deviation  is  equal  toi/(kpq). 
This  can  be  proved  as  follows: 

In  the  expansion  of  (p  +  q)k  the  first  term  pk  represents  the  proba- 
bility of  the  specified  event  succeeding  k  times.  The  mean  number  of 
times  it  succeeds  is  kp,  so  that  the  deviation  is  k  —  kp  and  the  first 
term  in  a2  is  pk  x  (k  —  kp)2.  Similarly  the  second  term  kp*-1  q  of  the 
binomial  represents  the  probability  of  (k  —  1)  successes,  and  the  devia- 
tion is  therefore  (k  —  1)  —  kp,  so  that  the  second  term  in  cr2  is 

kpk~l  q(k-l-  kp). 
Remembering  that  k  —  kp  =  kq  we  get  the  following  expansion  for  cr2 


*V  (p  +  qY  ~  2*V  (p  +  q?-1  +    ^-1  q  +  4 


+  kq  {p"-1  +  2(k-l)  pk~2  q  +  ...} 
lcq  {(p 


kq-  q) 
kq(l-q)  =  kpq,         a 
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(6)  THE  NORMAL  CURVE  OF  ERROR 

We  have  seen  that  a  binomial  expansion  gives  a  cocked  hat  figure 
very  like  the  actual  diagrams  obtained  by  experiment.  In  the  imaginary 
example  which  we  considered,  seven  Djinns  added  or  subtracted  each 
a  half  unit  to  or  from  the  quantity  13  \  which  we  were  trying  to  measure. 
We  then  obtained  measurements  extending  from  10  to  17  units,  but 
always  at  the  exact  units. 

In  practice  if  we  were  handling  a  not  easily  measurable  quantity 
we  might  well  find  our  measurements  range  over  this  distance,  but 
unless  they  were  constrained  to  do  so  by  some  peculiarity  of  the  method, 
they  would  not  always  occur  at  exact  units,  but  at  any  distance.  This 
case  is  covered  if  we  imagine  the  number  of  Djinns  (the  factors  of 
accidental  error)  to  be  much  increased  but  the  influence  of  each  one 
made  less:  thus  there  might  be  7000  Djinns  each  adding  or  subtracting 
a  mere  fraction  of  a  unit,  or  ultimately  an  infinite  number  of  them,  each 
adding  or  subtracting  an  infinitesimal  amount,  just  as  an  infinite  number 
of  the  tiniest  errors  (we  may  well  imagine)  account  for  the  variations 
in  our  experimental  readings. 

What  would  the  binomial  expansion  then  become?  That  is,  what 
form  does  the  expansion  of  (  J  +  J)fc  take  when  Jc  becomes  infinite  and 
the  terms  are  not  unit  distance  apart  but  only  an  infinitesimal  distance 
dx\  The  whole  range  covered  by  the  expansion  is  kdx,  and  the  term  at 
either  end  which  occurs  when  all  the  errors  are  either  positive  or  negative, 

is  at  a  distance  Jc  -^-,  from  the  middle,  so  that  each  elementary  error  is 

2i 

now  dx/2  just  as  formerly  it  was  half  a  unit. 

The  term  which  corresponds  to  I  errors  being  positive  and  k  —  I 
negative  is  p_  (J)I  (J)»-,  i!/{,,  (t  _  Q,}  ......  (1)> 

and  the  net  error,  the  abscissa  of  this  point,  is 

x  =  {l-(k-l)}dx/2  ......  (2). 

The  next  point,  distant  dx,  is  that  corresponding  to  I  +  1  positive  and 
Jc  —  I  —  1  negative  errors,  giving  an  abscissa  of 

{I  +  i  _  (k  _  i  _  i 
Its  probability  is 


The  ratio  of  these  is 

P  +  dP      k  -  I  dP      .   dP  _  k  -  21  -  1 

P          l+l          ~  P'    "   P  =        l+l 

B.  &  T. 
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But  I  =  -^-  +  5 ,  from  eqn.  (2)  for  x  above,  so  that 
dx      2 

dP_         2  (2x  +  dx) 

Now  we  are  going  to  make  k,  the  number  of  atomic  errors,  equal  to 
infinity,  and  we  can  therefore  neglect  the  2  in  k  -f  2  and  write 

J  ]O  O    /O/¥»       |        sj r*\  A  /¥•  O/7/Y* 

az^  Z  (ZS  -+•  »S;  _  4:S  adX 

P  2x  -f  A:^s  2s  +  &<fo      2s  +  kdx 

Therefore  -=— ,  which  is  the  quantity  we  require  before  we  can  integrate 

and  obtain  the  equation  to  the  continuous  curve  to  replace  the  binomial 
cocked  hat,  is  given  by 

~  P  dx  =  2xdx  +  kdx2  +  2x  +  kdx  (4)' 

We  must  now  consider  the  quantity  kdx  which  gives  the  entire 
extent  of  scatter,  the  whole  range.  The  number  of  errors  k  is  infinite, 
we  have  assumed,  and  dx  is  infinitesimal.  The  range  kdx  may  then 
either  be  finite  or  infinite.  If  we  assume  it  to  be  finite,  then  kdx2  will 
be  infinitesimal  and  the  above  equation  becomes 
1  dP  4x  2 


when  dx  becomes  infinitesimal.  Or  -v-  =  —  infinity,  except  when  P  =  0. 

In  this  case  the  probability  falls  off  infinitely  quickly  from  the  mean, 
and  this  is  therefore  a  case  of  no  scatter  at  all.  If  therefore  we  postulate 
an  infinite  number  of  elementary  errors,  we  must  allow  a  possible  range 
of  scatter  of  infinite  extent,  the  only  alternative  being  no  scatter  at  all. 
But  although  the  possible  range  is  infinite,  it  will  be  found  that  at 
infinity  the  probability  is  infinitesimal,  that  is  the  larger  errors  do  not 
in  practice  occur. 

We  take  therefore         7  , 

kdx  =  oo , 

k  (dx)2  =  a  finite  quantity*, 

which  we  shall  write  =  do2. 

(a  will  turn  out  presently  to  be  our  previous  acquaintance  the  stan- 
dard deviation.)  We  then  have  from  eqn.  (4) 

j.^P_         4s  x_ 

P  dx~  2xdx  +  402  ~~  o2 

*  Here  again  the  alternative  ought  to  be  investigated. 
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and  on  integrating, 

x2 
log  P  =  —  s~   +  constant, 


or  P  =  Ce 

This  equation  gives  the  probability  P  of  any  value  of  the  error  x 
occurring,  and  not  only  of  any  integral  value,  as  the  binomial  does. 
The  value  of  the  constant  C  will  be  found  presently,  and  it  will  also 
be  shown  that  a  is,  as  has  been  already  asserted,  the  same  standard 
deviation  which  we  already  know  in  another  guise.  The  general  shape 
of  such  Normal  Curves  is  shown  by  the  example  in  Fig.  6,  p.  43. 

Some  properties  of  the  Normal  Curve 

In  the  curve  P  —  Ce  ~  xZ/(2ff2\  p  is  the  probability  of  the  occurrence 
of  an  error  x.  Let  us  consider  of  what  order  of  magnitude  the  quantities 
P  are.  In  the  first  place  we  remember  that  the  curve  was  found  as  the 
limit  of  the  expansion  (J  +  J)fc,  when  k  was  made  infinite.  This  curve 
however  flattens  out  more  and  more  as  k  is  increased. 

For  example 


So  that  clearly  when  k  becomes  infinite,  the  ordinates  of  the  curve,  as 
we  have  so  far  considered  it,  flatten  out  so  that  they  are  all  infinitesimal. 
In  other  words  the  probability  P  of  any  exact  value  of  x  occurring  is 
really  infinitesimal  though  it  varies  from  one  x  to  another  x.  C  must, 
therefore,  be  infinitesimal.  Let  us  try  putting 

C  =  C'dx, 

so  that  if  C  is  of  the  same  order  of  smallness  as  dx,  the  new  quantity  C' 
will  be  &  finite  constant.  We  have  then 

P=C'e-x*^dx  ......  (7). 

This  means,  in  geometrical  language,  that  instead  of  using  a  curve  whose 
ordinate  P  measures  the  probability  of  the  occurrence  of  x,  we  had 
better  use  a  curve  where  this  probability  is  measured  by  the  area  of  an 
elemental  rectangle  contained  by  two  ordinates  enclosing  x,  and  distant 
dx  from  one  another  (cf.  Fig.  4,  p.  17).  Such  a  curve  will  be  similar  in 
shape  to  the  former  but  of  finite  ordinates. 

We  have  assumed  in  the  above  argument  that  C'  as  there  defined 

3—2 
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is  a  finite  quantity,  that  is  that  C  is  of  the  same  degree  of  smallness 
as  dx.  That  this  is  so  may  be  shown  by  proceeding  next  to  find  the 
actual  value  of  C'  in  the  following  manner. 

It  is  clear  that  the  sum  of  the  probabilities  of  all  errors  must  equal 
unity,  for  at  any  one  measurement  it  is  certain  that  some  error  or  other 
must  occur,  if  we  include  zero  error  in  the  mathematical  sense.  That  is 


Substitute  for  P  from  the  equation  (7)  above  and  replace  the  sign  of 
summation  by  integration.   We  thus  obtain 


-xdx=I  ...(8). 

Write  o^/(2a2)  =  z2,    xdx  =  2o*zdz; 

then  the  integral  becomes 


Whence,  since  this  equals  unity,  we  have 

C'  =  l/((7  V2  V"),  a  finite  quantity  ......  (10). 

So  that  we  can  now  write 

p  -        dx        e-*2/(2<r2)  (H\ 

•*•    -  //o    \  c  ......  l*r*7" 

•     a  v  (ZTT) 

The  ordinate  of  the  curve  defined  on  p.  35  is  y  =  P/dx  so  that  its 
equation  is 

......  (12). 


Such  a  curve  is  called  a  probability  curve.    The  probability  of  the 
occurrence  of  the  value  x  is  given  by  yxdx  and  the  probability  of  x  falling 

between  a  and  6  is  given  by       ydx.    The  total  area  I       ydx  equals 

J  b  J  -co 

unity. 

If  N  measurements  were  made,  and  the  errors  were  distributed 
according  to  such  a  curve,  then  the  most  probable  number  of  times  a 
deviation  x  occurred  would  be  NPX.  A  curve 

N 


is  similar  to  a  probability  curve,  but  each  ordinate  is  N  times  as  tall. 

*  The  reference  is  to  Integral  B  in  Appendix  II  giving  the  values  of  a  number  of 
integrals  of  general  use  in  the  theory  of  probability. 
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It  is  a  distribution  curve  and  its  total  area  is  not  unity  but  N,  and  the 
integral  from  a  to  b  represents  not  the  probability  of  x  falling  between 
a  and  6,  but  the  most  probable  number  of  times  it  would  fall  in  this 
range  out  of  N  trials. 

Since  this  curve  is  symmetrical,  the  mean,  mode  and  median  coincide. 

Let  us  find  the  standard  deviation  of  such  a  set  of  measurements. 

The  sum  of  the  squares  of  the  deviations  is  given  by 


r 

J_ 


N       r         z*      o  2          2cr2z<&      NZo2      0_. 

e ~"   x  2cr2z2  x  -^-  =  — -r-  x  20*  =  No2. 


(write  z2  =  2cr2z2,     xdx  =  2o2zdz) 

Y2o^ 

\/TT 

The  meaw  square  deviation  is  NcP/N  =  cr2. 

The  rooZ  meaw  square  deviation,  or  standard  deviation,  is  therefore  cr, 
so  that  we  were  justified  in  using  this  letter  on  p.  34  when  we  wrote 

k  (dx)2  =  4C72. 

The  quantity  cr  can  be  shown  to  have  another  significance  in  the  normal 
curve,  namely  it  is  the  distance  from  the  centre  to  the  point  of  inflection 
or  point  where  the  curve  changes  from  convex  to  concave.  The  proof 
of  this  statement  is  as  follows.  At  a  point  of  inflection  of  a  curve,  the 
value  of  d2y/dx2  is  zero.  In  our  case 


If  this  is  to  equal  zero  then  x2  must  equal  or2. 

Finally,  if  we  suppose  the  area  enclosed  between  a  distribution  curve 
and  the  axis  of  x  to  be  spinning  round  the  axis  of  y,  then  the  moment 
of  inertia  is  such  that  we  can  consider  the  whole  weight  concentrated 
equally  at  the  two  points  of  inflection.  In  other  words,  o-  is  the  "radius 
of  gyration."  For  the  area  (or  weight)  of  each  vertical  elementary 
column  of  the  curve  is  ydx  and  its  distance  from  the  axis  of  gyration  is  x. 

The  moment  of  inertia  is  therefore 


See  Integral  C,  Appendix  II,  p.  202. 
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Write  x2  =  2o2z2  and  the  integral  becomes 

n* 

C*  = 


Therefore  the  N  can  be  considered  as  concentrated  at  a  distance  <r  from 

N 
the  centre,  or  ^-  at  each  point  of  inflection. 


The  Relation  between  Mean  Variation  and  Standard  Deviation, 
in  the  case  of  Normal  Distribution 

The  distribution  is  theoretically  symmetrical  and  the  mean  of  the 
positive  variation  ought  to  equal  the  mean  of  the  negative  variation, 
apart  from  sign.  We  have  therefore 

S(x) 

M    V     —          v     ' 

"  N/2' 

S  indicating  sum  from  0  to  oo  of  the  variations  x.    Remembering  that 
the  frequency  of  occurrence  of  each  x  is 


we  replace  the  sum  S  by  an  integral,  obtaining 

N 


dx. 


Nft 

Writing  x  =  \/2ag  and  using  integral  A  of  our  list  of  definite  integrals 
(indeed  this  expression  integrates)  we  obtain 


M.V.  =  -(7 

(r\/7T        Jo  V77 

The  numerical  value  of  ^  is  -8  approx. 

VT 

The  Relation  between  Probable  Error  and  Standard  Deviation, 
in  the  Case  of  Normal  Distribution 

The  probable  error  was  defined  on  p.  22  as  an  arbitrary  reduction 
of  the  standard  deviation,  viz. 

P.E.  =  -6745(7. 
In  the  case  of  Normal  Distribution  a  physical  meaning  can  be  given  to 

*  See  Integral  C  in  Appendix  II,  p.  202. 
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this  quantity.  Consider  the  numbe^of  cases  which  fall  within  the  limits 
of  the  range  ±-6745cr.  This  number  is 

iV          r6745<r 

2  x  I          e~x*/(     ^  dx 

a  V(277)  J  o 

Values  of  the  probability  integral  have  been  calculated  and  tabulated 
for  all  limits,  and  if  such  a  table  be  consulted*  it  will  be  found  that  this 

N 
quantity  has  the  value  -^ .  That  is  to  say,  the  probable  error  gives  the 

range  within  which  one-half  the  cases  may  be  expected  to  fall.  It  is 
more  instructive  to  remember  that  half  the  cases  may  be  expected  to 
fall  outside  ±  P.E. 

With  skew  distributions  this  meaning  ceases  to  hold,  and  for  these 
a  should  be  used.  Indeed  it  is  in  general  a  better  quantity  to  employ. 

(7)  ON  FITTING  A  NORMAL  CURVE  TO  DISTRIBUTION  DATA 

Consider  the  experiment  of  bisecting  a  line  of  which  the  data  are 
given  on  p.  15.  The  histogram  of  Fig.  3,  p.  16  represents  these  data, 
and  shows  the  density  with  which  these  points  occur  in  each  part 
of  the  range.  We  wish  to  replace  this  stepwise  figure  by  a  smooth 
Normal  Curve  which  will  give  us  at  each  point  of  the  range  the  theoretical 
proportionate  density  of  the  bisection  points  at  that  spot,  and  we  want 
this  Normal  Curve  to  be  the  most  probable  which  can  be  based  on  the 
given  data. 

The  equation  of  the  required  curve  is 

N 


where  N  is  the  number  of  experiments  made,  x  is  measured  in  mms., 
and  (x  —  a)  is  a  quantity  measured  from  some  central  point  of  the  data. 
In  the  theoretical  curve  (which  is  symmetrical)  a  is  the  point  where 
mean,  mode  and  median  coincide.  In  the  data  however  the  mean  is 
60*13  and  the  median  60  while  the  mode  is  unknown.  What  value  shall 
we  give  to  a  so  as  to  obtain  the  best  fitting  curve? 

To  answer  these  questions  it  is  necessary  to  consider  what  we  mean 
by  best  fitting  curve. 

The  meaning  of  "best  fitting  curve"  is  really  the  same  as  the 

meaning  attached  to  the  phrase  "most  probable  theory."   By  the  best 

theory  of  any  set  of  data  we  mean  the  theory  from  which  the  observed 

data  could  have  chanced  to  spring  with  a  greater  probability  than  would 

*  E.g.  the  first  table  in  Pearson's  Table*  for  Statisticians  and  Biometncians. 
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be  the  case  with  any  other  theory^Take  a  simple  example.  Suppose 
a  bag  is  known  to  contain  a  large  number  of  equal-sized  balls  and  nothing 
else:  but  the  colours  of  the  balls  are  entirely  unknown.  Experiments 
have  been  carried  out  to  learn  something  about  their  colours,  each 
experiment  taking  the  form  of  extracting  one  ball,  noting  its  colour, 
and  replacing  it.  Ten  such  experiments  have  been  made  and  the  results 
are  as  follows  : 

5  black  balls, 
3  white  balls, 
1  red  ball, 
1  green  ball 

What  is  the  best  theory  of  the  composition  of  the  bag? 

The  best  theory  on  the  facts  as  given,  is  that  the  bag  contains 
^ths  black  balls,  ^ths  white  balls,  ^th  red  balls,  and  T^th  green  balls. 

Suppose  however  that  another  theory  was  advanced,  namely  that 
the  bag  contained  ^ths  black  balls,  T%ths  white  balls,  and  ^th  each  of 
red,  green,  yellow  and  blue  balls.  How  should  these  two  theories  be 
compared? 

The  proper  plan  is  to  find  the  probability,  on  each  theory,  that 
ten  dips  would  result  in  what  was  actually  observed.  Then  that  theory 
is  best  for  which  this  probability  is  greatest.  Let  us  take  first  the  theory 
which  we  assert  to  be  the  best.  The  probability,  on  that  theory,  of 
drawing  five  black,  three  white,  one  red  and  one  green  ball  (the  order 
being  immaterial)  is 

1  .1.       10!         -0-042525 
10   10  5!3!l!ir 

The  corresponding  probability  on  the  other  theory  is 
s/4\3    1     1          10! 


/4\s/4\3    1     1 
(l2J  (W  •iri 


The  former  theory  is  therefore  the  better  of  the  two.  If  from  other 
sources  we  knew  however  that  the  bag  did  contain  balls  of  six  colours, 
then  the  first  theory  would  be  ruled  out,  and  the  second  theory  would 
have  to  be  compared  with  other  six-colour  theories.  The  reader  might, 
for  example,  compare  it  with  the  theory  that  the  bag  contains  ^ths 
black,  T\ths  white,  ^ths  red  and  ^th  each  of  green,  blue  and  yellow 
balls. 

Turn  now  to  our  actual  experiments  on  bisecting  a  line.  The  29 
experiments  are  like  29  dips  into  a  bag,  resulting  in  the  numbers  on 
p.  15  being  drawn.  From  our  general  consideration  of  the  problem 
we  believe  that  the  numbers  in  the  bag  form  a  continuum,  that  is  we 
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think  that  another  bisection  mark  might  fall  anywhere  within  the  range, 
and  not  only  at  points  already  struck,  though  of  course  our  measurements 
are  only  being  made  to  the  nearest  tenth-millimetre.  Secondly  we  think 
that  a  curve  of  the  form 


will  express  this  continuum.  The  problem  is,  which  curve  of  this  form 
is  the  best  fitting  one.  This  is  decided  in  exactly  the  same  way  as  was 
the  above  example  of  the  coloured  balls.  For  each  curve  find  the 
probability  that  the  actually  observed  distribution  may  have  arisen 
from  it  by  random  sampling.  Then  that  curve  is  the  best  for  which  this 
probability  is  greatest. 

The  actual  process  of  trial  and  error  giving  now  this  now  that  value 
to  a  and  a  would  take  too  long,  and  we  find  the  optimum  values  by  the 
usual  process  of  the  differential  calculus.  Let  us  do  so  in  a  quite  general 
manner,  taking  n  values  (instead  of  29  as  here). 

Given  a  distribution  following  the  law 


what  is  the  probability  Q  that  the  n  special  values 

xl>  X2>  X3>  x&  •••  x*>  •••  xn 

will  be  obtained  in  a  set  of  n  trials  (the  order  being  immaterial)  ?   The 

probability  of  ZA  is  •,„ 

dx 


and  the  required  probability  is  equal  to  the  continued  product  of  a 
number  of  such  expressions,  in  which  A  takes  the  values  1  to  n  succes- 
sively, multiplied  by  n  !  because  the  order  is  immaterial. 
We  therefore  have 


This  probability  we  wish  to  make  a  maximum  by  choosing  the  best 
values  of  cr  and  a.  We  must  therefore  put 

dQ/da  =  0 
and  dQ/da  =  0. 

Now         O  = 
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and  since  this  equals  zero  we  must  have 

S  (»A  -  a)  =  0, 
/.  S  fa)  =  S  (a)  =  na, 

a  =  S  (xj/n. 

That  is,  a  must  be  the  mean  of  the  observations.   Turning  now  to  the 
second  equation  we  have  (omitting  from  the  first  the  factor 
which  is  independent  of  a) 


Therefore  no2  =  S  (#A  —  a)2, 

<r2  =  S  (icA  -  a)*/n, 
i.e.  a  is  the  standard  deviation  of  the  readings. 

We  find  therefore  that  to  obtain  the  best  fitting  curve  we  must 
find  a  the  value  of  the  mean  of  the  observations,  and  a  the  value  of  the 
standard  deviation.  In  the  case  in  question 

a  =  60-13, 
a=  1-38, 
29 


so  that 


1 


e-(*-60-13)2/(2xl-382) 


where  a  is  in  mms.  This  curve  is  drawn  on  the  adjoining  figure.  Calcula- 
tions, such  as  those  required  to  find  a  number  of  ordinates  of  the  above 
curve  for  the  purpose  of  drawing  it,  are  best  performed  in  tabular 
fashion.  For  example,  the  present  calculation  might  be  arranged  as 
follows,  and  the  reader  should  calculate  one  or  two  ordinates  for  practice 
by  this  means. 


(«) 

(6) 

(c) 

(*) 

w 

</) 

x  in  mms. 

x  -60-13 

2 

2  x  1-382 

Column  c 
x  loge 

Reciprocal  of 
Antilog.  column  d 

r^lnmn  « 

This  arrangement  is  suitable  for  an  approximate  calculation  using 
a  ten  inch  slide  rule,  from  which  the  logarithms  are  also  taken.    The 
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accuracy  thus  attained  is  quite  as  great  as  the  extent  of  the  experiment 
justifies.  If  logarithm  tables,  or  calculating  machines,  were  to  be  used, 
somewhat  different  tabular  arrangements  would  be  required. 

Some  of  the  calculation  in  the  above  table  has,  however,  once  and 
for  all  been  done  and  printed  in  tables  of  the  probability  curve.  The 
best  of  these  for  our  purpose  is  Sheppard's  table,  printed  as  Table  II 
n  Professor  Pearson's  Tables  for  Statisticians  and  Biometricians. 


56 


57 


58 


59 


62 


63 


64 


60          61 
Millimetres 

Fig.  6.  A  normal  curve  fitted  to  the  bisection  data. 
The  histogram  is  also  shown 

Fitting  a  Normal  Curve  to  the  Bisection  Data  with  the 
Aid  of  Sheppard's  Tables* 


I 

II 

III 

IV 

V 

x  in  mms. 

a;'  =s-  60-13 

Sheppard's  x 
•71*88 

Sheppard's  z 

y=29z/l-38 

60-13 

0 

0 

•3989432 

8-38 

60 

0-13 

0-09 

•397 

8-34 

59-5 

0-63 

0-46 

•359 

7-54 

59 

1-13 

0-82 

•285 

5-99 

58-5 

1-63 

1-18 

•199 

4-18 

58 

2-13 

1-54 

•122 

2-56 

57 

3-13 

2-27 

•030 

0-63 

56 

4-13 

2-99 

•005 

0-11 

The  other  half  of  the  curve  can  be  drawn  by  symmetry.  No  interpolation  is  here  used  in 
the  tables.  A  slide  rule,  or  Crelle's  Calculating  Tables,  can  be  used  for  the  multiplications. 

*  Of  course  an  experiment  of  only  29  observations  does  not  justify  any  curve  fitting 
at  all,  as  the  accuracy  is  not  sufficient,  the  sample  not  being  large  enough.  But  a  short 
example  is  necessary  for  explanatory  purposes  in  a  text-book. 
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We  are  at  present  only  concerned  with  Sheppard's  first  and  fifth 
columns  headed  x  and  z  respectively.    These  are  connected  by  the 

relationship  i 

1 


that  is  they  give  a  curve  with  N  =  1  and  a  =  1.  The  x  of  Sheppard's 
table  is,  therefore,  obtained  from  our  x  by  division  by  o-,  and  his  z  needs 
to  be  multiplied  by  N/a.  Our  calculation  then  takes  the  form  shown 
above  Fig.  6. 

It  must  be  clearly  understood  that  there  are  two  parts  in  any  curve 
fitting  problem.  Firstly  there  is  the  decision  as  to  what  kind  of  curve 
is  to  be  used,  and  secondly  the  finding  of  the  best  fitting  curve  of  this 
kind.  If  it  were  merely  a  question  of  getting  a  curve  to  fit  the  particular 
data  of  one  problem,  then  a  curve  could  always  be  drawn  to  fulfil  the 
conditions  exactly.  It  is  only  because  general  considerations  dictate  that 
the  curve  shall  be  of  a  certain  kind,  that  an  exact  fit  cannot  be  obtained 
and  the  best  fit  has  to  be  found. 

(8)  THE  METHOD  OF  LEAST  SQUARES 

The  principles  adopted  in  the  last  section  are  those  which  underlie 
the  Method  of  Least  Squares,  which  is  employed  to  find  the  best 
solutions  of  a  set  of  linear  equations  which  are  more  numerous  than 
the  unknowns,  and  slightly  inconsistent  with  one  another.  To  illustrate 
the  principle  take  three  equations  for  two  unknowns, 

ax  -f  by  +  c  =  0, 
a'x  +  b'y  +  cf  =  0, 
a"x  +  i»y  +  c"  =  o. 

The  quantities  a,  a',  a",  b,  etc.  having  been  measured,  the  equations 
are  found  not  to  come  to  zero  for  any  values  of  x  and  y,  but  to  give 
"residuals"  v,  v',  and  v"  thus: 

ax  -{-by  +  c  =  v, 
a'x  +  b'y  4-  c'  =  v', 
a"x  +  b"y  +  c"  =  v". 

Now  the  presence  of  these  residuals  v  may  be  assumed  to  be  due  to 
numerous  small  errors  in  the  coefficients  a,  b,  c,  and  the  distribution  of 
any  v,  were  one  of  these  equation-  observations  to  be  repeated  many 
times,  will,  it  may  be  assumed,  be  a  Normal  Curve.  The  probability 
of  occurrence  of  v  will  therefore  contain  as  chief  factor  e~v*  and  the 
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probability  of  occurrence  of  v,  v'  and  v"  (presuming  them  independent) 
will  contain  the  factor 

e-*  x  e-*"  x  e-«"\   or   e'8™, 
where  S  (v2)  =  v2  +  v'2  +  v"2. 

To  make  this  probability  a  maximum  we  must  make  S  (v2)  a  minimum, 
whence  the  name  Least  Squares.  The  conditions  for  S  (v2)  a  minimum 
are 

dS/dx  =  'dS/dy  =  0, 

and  also  (though  in  practice  it  is  not  necessary  to  find  them)  the  second 
differentials  must  be  negative.  We  get  at  once 

j-  {(«*  +  by  +  c)2  +  (a'x  +  b'y  +  c')2  +  (a"x  +  b"y  +  c")2} 
=  2  (a2  +  a'2  +  a"2)  x  +  2  (ab  +  a'V  +  a"6")  y  +  2  (ac  +  o'c'  +  a"c")  -  0, 

and  similarly  for  y. 

The  two  equations  thus  reached  are  called  Normal  Equations.  There 
will  of  course  always  be  as  many  of  them  as  there  are  unknowns.  If  the 
original  equations  were  not  of  equal  reliability  or  weight  they  must  of 
course  first  be  multiplied  by  their  weights.  The  rule  of  Least  Squares 
is  therefore:  To  obtain  the  Normal  Equation  for  any  unknown  x,  multiply 
each  equation  by  its  weight  w,  and  by  the  coefficient  of  x  in  that  equation,  and 
then  add  all  the  equations  together.  The  Normal  equations  in  our  example 
are  thus: 

S  (azw)  x  +  S  (abw)  y  +  S  (acw)  =  0, 

S  (abw)  x  +  S  (b2w)  y  +  S  (bcw)  =  0, 
whence  unique  values  of  x  and  y  can  be  found. 


CHAPTER  III 

THE  PSYCHOPHYSICAL  METHODS 

Experimental  methods  and  mathematical  processes — The  method  of  limits — The 
method  of  average  error — The  constant  method — Difference  thresholds  and  the 
probability  of  a  judgment  of  a  certain  category. 

(1)  EXPERIMENTAL  METHODS  AND  MATHEMATICAL  PROCESSES 

THE  experimental  determination  of  absolute  and  difference  thresholds 
or  limina  is  complicated  and  difficult.  A  considerable  number  of  physical 
and  psychological,  and,  it  may  be  added,  mathematical,  factors  is 
involved,  of  varying  relative  importance  in  different  cases.  The  result 
is  that  different  methods  of  procedure  have  been  found  most  suitable 
for  different  cases.  These  methods  have  been  traditionally  grouped 
under  three  (or  four)  distinct  headings,  and  called  the  Psychophysical 
Methods.  They  are: 

(1)  the  Method  of  Limits  (Method  of  Minimal  Changes), 

(2)  the  Method  of  Average  Error  (Method  of  Production), 

(3)  the  Constant  Method  (Method  of  Right  and  Wrong  Cases). 
A  fourth  method  is  generally  added  to  the  list,  viz.: 

(4)  the  Method  of  Equal  Appearing  Intervals,  or  Method  of  Mean 
Gradations,  but  this  is  really  no  new  method.  It  owes  its  special  name 
to  the  nature  of  the  task  which  it  fulfils,  viz.  the  determination  of  equal- 
appearing  (uebermerklich)  sense-distances  as  distinguished  from  just 
perceptible  (ebenmerklich)  sense-distances.   The  method  which  it  employs 
falls  under  one  or  other  of  the  first  three  headings. 

There  are  two  things,  essentially  different  from  each  other,  which 
are  commonly  confused  under  this  one  heading  "  psychophysical 
methods,"  namely  the  methods  of  experimenting  in  order  to  obtain 
data,  and  the  processes  of  calculation  after  the  data  have  been  collected. 
To  avoid  this  confusion  the  words  "method"  and  "process"  will  be 
employed  throughout  this  book  in  the  way  indicated  by  their  use  in 
the  above  sentence.  It  is  urged  that  their  general  adoption  would  be 
advantageous.  The  cause  of  the  confusion  is  to  be  found  in  the  historical 
development  of  the  subject,  for  with  each  of  the  methods  of  experi- 
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menting  a  process  of  calculation  was  associated  and  the  one  name  was 
given  to  both. 

The  experimental  methods  of  determining  thresholds  may  be 
divided  into  two  main  groups: 

1.  Methods  in  which  the  stimulus  is  altered  continuously  until  in  the 
opinion  of  the  subject  it  fulfils  some  given  condition. 

2.  Methods  in  which  various  values  of  the  stimulus  are  separately 
submitted  by  the  experimenter  to  the  subject  who  expresses  a 
judgment  on  each  of  them,  classifying  them  into  two  or  more 
categories. 

The  methods  belonging  to  the  second  group  may  in  turn  be  classified 
according  to  the  order  in  which  the  stimuli  are  submitted  to  the  subject. 
The  second  group  is  thus  subdivided  as  follows: 

2  (a).  Methods  in  which  the  order  of  succession  of  the  stimuli  is 
irregular,  non-consecutive. 

2  (6).  Methods  in  which  the  order  of  succession  of  the  stimuli  is 
consecutively  (i)  ascending  or  (ii)  descending. 

Under  2  (a)  comes  the  Method  of  Right  and  Wrong  Cases.  Under 
2  (6)  come  the  Method  of  Minimal  Changes  and  the  Method  of  Serial 
Groups*.  The  latter  is  a  special  case  of  the  Method  of  Minimal  Changes, 
in  which  each  value  of  the  stimulus  is  submitted  a  number  of  times 
before  the  next  consecutive  value  is  submitted.  A  corresponding  method 
under  2  (a)  is  the  Method  of  Non-Consecutive  Groups,  in  which  groups 
are  taken  in  an  irregular  order.  It  is  customary  in  the  Method  of  Serial 
Groups  to  introduce  among  the  stimuli  an  equal  number  of  "  catch  "  cases, 
in  which  no  stimulus  (or  stimulus  difference)  at  all  is  given.  This  might 
also  be  done  in  other  methods.  Each  method  can  be  further  subdivided 
according  as  the  subject  is  or  is  not  warned  beforehand  of  the  kind  of 
succession  to  expect. 

The  differences  which  result  from  the  use  of  these  various  "methods" 
are  clearly  due  to  their  different  psychological  effects  upon  the  subject. 
But  we  have  also  to  take  into  consideration  the  differences  in  the 
"processes"  of  calculation.  These,  of  course,  are  purely  mathematical 
and  ignore  the  subject  altogether. 

*  We  adopt  here  the  convenient  name  suggested  by  G.  M.  Stratton  who  first  described 
the  method  (Psychol.  Rev.  1902,  ix.  pp.  444 — 447).  It  had,  however,  been  independently 
discovered  and  employed  by  W.  McDougall  four  years  previously  during  his  stay  in 
Murray  Island  (Rep.  Cambridge  Anthrop.  Expedition  to  Torres  Straits,  Cambridge,  1903, 
n.  pp.  190—193). 
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(2)  THE  METHOD  OF  LIMITS 

In  using  this  method  for  the  determination  of  difference  limina  the 
following  mode  of  procedure  is  adopted.  The  variable  stimulus  V  is 
first  made  equal  to  or  slightly  larger  than  the  standard  S,  and  then 
increased  step  by  step  by  small  increments  until  the  subject  finds  it 
just  perceptibly  greater  than  S.  V  is  increased  still  more,  and  then 
gradually  diminished  until  it  just  ceases  to  appear  greater  than  S.  The 
mean  of  the  two  values  of  V — S  thus  obtained  is  the  upper  difference 
limen,  Tv. 

Four  values,  instead  of  two,  might  also  be  obtained,  viz.  for  (i)  V  just 
not  perceptibly  greater  than  S,  (ii)  V  just  perceptibly  greater  than  S, 
(iii)  V  just  perceptibly  greater  than  $,  (iv)  V  just  not  perceptibly  greater 
than  S;  (i)  and  (ii)  being  for  ascending  values  of  F,  (iii)  and  (iv)  for 
descending.  Tu  will  be  the  mean  of  these  four  values  of  V — S. 

The  lower  difference  limen,  Tly  is  obtained  in  a  similar  way.  Both 
limina  may  be  obtained  in  the  same  series  of  experiments  by  the 
"method  of  complete  descents  and  ascents." 

A  series  of  determinations  of  each  limen  is  made  and  the  average 
taken.  A  measure  of  scatter  (mean  variation,  say,  or  standard  deviation) 
is  also  calculated.  Absolute  thresholds  may  also  be  found  by  this  method. 

The  number  and  size  of  the  increments  employed  must  be  adjusted 
to  the  particular  conditions  of  the  experiment.  The  subject  of  the 
experiment  should  be  given  a  certain  amount  of  preliminary  practice 
before  being  started  upon  the  work,  and  introspective  reports  should  be 
asked  of  him. 

Among  the  various  possible  sources  of  error  which  deflect  the 
subject's  judgment  both  in  this  and  in  the  other  psychophysical  methods, 
two  are  of  special  importance.  They  arise  from  the  temporal  and  spatial 
arrangement  of  the  compared  stimuli,  and  are  called  the  "time  error" 
and  "space  error"  respectively.  Thus,  in  a  determination  of  the 
difference  limen  for  sound-intensities,  the  two  stimuli,  S  and  V,  cannot 
be  presented  simultaneously.  One  must  precede  the  other,  and  in  this 
way  it  may  produce  a  slight  degree  of  fatigue  which  causes  an  over- 
estimation  of  the  other,  or  it  may  produce  the  reverse  effect  of  sharpening 
the  attention  to  the  second.  Thus  a  time  error  arises.  Again,  in  experi- 
ments with  brightness-intensities  or  visual  extents,  where  the  stimuli 
can  be  presented  simultaneously,  a  space  error  arises  from  the  fact  that 
the  one  stimulus  must  be  presented  either  to  the  right  or  to  the  left  of 
the  other  stimulus,  and  the  subject's  judgment  varies  accordingly.  In 
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experiments  with  lifted  weights  both  sources  of  error  may  be  involved. 
These  so-called  constant  errors  may  be  approximately  neutralised  by 
arranging  that  in  the  course  of  the  experiment  the  standard  shall  precede 
the  variable  or  stand  to  the  right  of  the  variable  in  half  the  cases  and 
follow  or  stand  to  the  left  in  the  other  half  —  the  time  or  space  order, 
or  both,  being  of  course  changed  quite  at  random  in  the  successive 
limen  determinations.  A  better  plan  is  to  evaluate  the  limen,  and  its 
scatter,  for  each  time  and  space  order  separately.  This  gives  us  the 
values  of  the  time  error  and  the  space  error,  which  are  of  interest  for 
their  own  sakes.  Fechner's  theory  of  these  errors  and  their  measurement 
is  based  upon  the  assumption  that  the  time  and  space  orders  of  the  two 
stimuli,  S  and  V,  exert  an  influence  upon  the  result  which  is  equivalent 
to  a  definite  increase  or  diminution  in  the  stimulus-value  of  one  or  the 
other,  and  thus  increase  or  diminish  the  value  of  S  ~  V  by  the  amount 
of  the  time  error  el9  or  that  of  the  space  error  e2,  or  by  the  sum  or  the 
difference  of  these  two.  The  time  error  is  positive  or  negative  according 
as  the  effect  of  the  time  order  is  to  increase  or  diminish  the  apparent 
value  of  the  first-presented  stimulus.  The  space  error  is  positive  or 
negative  according  as  the  effect  of  the  space  order  is  to  increase  or 
diminish  the  apparent  value  of  the  left-hand  stimulus. 

Four  principal  cases  of  time  and  space  order  are  possible,  and  are 
conventionally  numbered  as  follows*: 

I  Standard  presented  first      and  to  the  right, 
II         „  „        second 

III  „  „        first  „  left, 

IV  „  „        second 

Employing  these  numbers  as  suffixes,  we  have  the  equations 
TU1  =  Tu-€l  +  e^          Th  ^Tl  +  el-  e2, 
TUll  =Tu  +  ei  +  e2,          Thl  =  Tl-e1-  e2> 
TUlll  =  Tu-ei-e2>          Tlm=  Tl  +  ei  +  e2> 
TUiv  =--Tu  +  ei~  e^          Thv=  T,  -  6t  +  e2, 

whence 

T     j_  T  T      -i-  T  T     -4-  T      -4-  T       -t-  T 

T  l  IV  _      u"          "in  ^_  ±u\  "T      un  ^  •*•  "in  ^  ^"iv 

22  4 

and  a  similar  expression  holds  for  Tt;  and 

o  -     _   rp  rp        _   rn          _   rr\         _   rr\       _   rr\       _   rrt          _   rji 

^61  —  •*•  un  ~  •*•  «!  -  •*•  «IV        •*  um       •*  I,        Lh\~   *  hn        L 


\ 


*  G.  E.  MiiUer,  Die  Oesichtsjmnkte.  und  die  Tatsachen  der  psychophysischen  Methodik, 
1904,  pp.  67,  71. 

B.  &  T.  4 
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The  errors  due  to  expectation,  habituation,  fatigue,  etc.,  are  neu- 
tralised or  at  least  reduced  to  a  minimum  by  determining  the  limen  by 
means  of  both  ascending  and  descending  values  of  V  and  averaging,  and 
by  other  special  precautions  in  applying  the  method. 

The  Mathematics  of  the  Method  of  Limits 

It  is  to  Professor  F.  M.  Urban  that  we  owe  the  most  complete  dis- 
cussion of  the  mathematical  foundations  of  this  method*. 
Let  the  variable  stimulus  V  have  the  values 

sl>  52,  53>  •••  Sn' 

Then  for  constant  experimental  conditions  there  will  be,  Prof.  Urban 
assumes,  for  each  stimulus  a  probability  that  a  certain  judgment  (say 
Greater)  will  be  given.  Let  these  probabilities  be 

fefi»ffe."fftp 

If  the  stimuli  were  arranged  in  order  of  magnitude  beginning  with  the 
smallest,  then  these  probabilities  will  also  be  so  arranged.  Let  us 
further  write  q  for  the  probability  that  this  judgment  will  not  be  given. 
Then  for  each  q  and  p  we  have 

l-p  =  q. 

Consider  ascents  first.  A  stimulus  s  is  noted  as  a  just  perceptible 
point  if  the  answer  greater  is  given  at  s  and  was  not  given  at  any  lower 
point  in  that  series.  This  is  a  compound  event,  and  its  probability  is 
the  product  of  a  number  of  g's  for  the  lower  stimuli  where  greater  was  not 
the  answer,  and  one  p  for  the  stimulus  s  itself  where  the  answer  greater 
is  returned.  Let  P  represent  the  probability  that  s  will  be  noted  as  one 
reading  of  the  just  perceptible  point,  then  we  have  the  set  of  equations 

^i  -  Pi. 


and  the  mean  of  the  just  perceptible  points  will  be 

T  =  ^Pi  +  %P2  +  53P3  +  ...  +  snPn  =  S  (sP). 
The  standard  deviation  will  be  given  by 

(T2  =  S  {(s  -  T)z  P}  =  S  (s*P)  -  T2. 

*  See  "Die  psychophysischen  Masamethoden,"  Archivf.  d.  ges.  Psychologic,  1909,  xv. 
p.  289;  "On  the  Method  of  Just  Perceptible  Differences,"  PsychoL  Rev.  1907,  xrv.  p.  244; 
The  Application  of  Statistical  Methods  to  the  Problems  of  Psychophysics,  Philadelphia,  1908. 
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An  example  will  make  the  whole  of  this  much  clearer.  Weights  of 
84,  88,  92,  96,  100,  104  and  108  grams  were  compared,  by  lifting,  with 
a  standard  weight  of  100  grams,  and  the  replies  given,  which  referred 
to  the  variable  weight,  were  heavier,  equal  and  lighter.  They  were 
recorded  as  follows : 


84        88        92        96 


100        104 


108 


e 

1 

h 

h 

e 

h 

h 

1 

h 

e 

h 

h 

h 

b 

1 

1 

1 

h 

h 

h 

h 

1 

1 

1 

e 

e 

h 

h 

1 

1 

e 

h 

h 

h 

h 

and  so  on  for  400  rows,  the  letters  h,  e  and  1  being  the  initial  letters  of 
the  answers  given.  The  five  rows  shown  in  extenso  give  five  readings  of 
the  just  perceptibly  heavier  point,  viz.  92,  88,  96,  104  and  96  grams.  The 
400  of  these  obtained  from  the  400  rows  were  distributed  as  follows 
(Urban's  Subject  I): 

Grams  84        88        92        96        100        104        108 

Frequency  0          7        30        76        106        169          12        400  in  all. 

The  mean  of  these  points  is  100-36,  which  is  therefore  the  directly 
observed  threshold  of  just  perceptible  positive  difference.  (It  should  be 
noted  that  a  time  error  is  included  which  accounts  for  the  nearness  of 
this  point  to  the  standard.)  The  standard  deviation  is  4-35  grams. 

The  frequency  with  which  the  answer  heavier  was  given  at  each 
stimulus  can  also  be  found  from  the  records.  The  application  of  Urban's 
Formula,  based  on  these  frequencies  p,  is  then  as  follows : 


Grams 

1 

P 

<l 

q  products 

P 

Ps 

Pa2 

84 
88 
92 
96 
100 
104 
108 
112 

-4 
-3 
-2 
-1 
0 
1 
2 
3 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•9978 
•9800 
•9111 
•7778 
•5867 
•1044 
•0600 

•9978 
•9778 
•8909 
•6929 
•4065 
•0424 
•0025 

•0022 
•0200 
•0869 
•1980 
•2864 
•3641 
•0399 
•0025 

-  -0088 
-•0600 
-  -1738 
-•1980 

•3641 
•0798 
•0075 

•0352 
•1800 
•3476 
•1980 

•3641 
•1596 
•0225 

sums    4-0108 
grams  per  working  unit            4 

1-0000 

•4514 
-•4406 

1-3070 
•0001  =T* 

Origin* 

16-0432 
84 

T  =  -Q10S 
working  unit         4 

l-3069=<r2 
M4  =  (T 
4  working  units 

4-56  grams  =<r 

100-0432 

origin 

4 

•0432 
100 

100-0432 

^^^mm^mmmm 

*  See  explanation  in  text  to  follow. 
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The  calculated  point  of  just  perceptible  positive  difference  is  therefore 
100-04  grams  and  the  standard  deviation  4-56  grams.  It  will  be  seen 
that  a  working  origin  has  been  taken  at  100  grams,  and  a  working  unit 
equal  to  4  grams,  to  simplify  the  arithmetic.  The  column  headed 
q  products  is  the  continued  product  of  the  q's  from  the  84  end.  The 
P  column  is  most  quickly  formed  as  the  differences  of  successive 
numbers  of  the  product  column.  The  P's  rise  to  a  maximum  and  then 
sink  again.  They  represent  in  fact  the  cocked  hat  distribution  of  the 
just  perceptibly  heavier  points,  and  should  be  compared  with  the  actual 
distribution  of  the  latter.  The  values  P  really  give  the  distribution  which 
would  be  found  were  an  infinite  number  of  ascents  to  be  made,  the 
probabilities  of  the  answer  heavier  at  the  different  points  remaining  through- 
out constant  at  the  actual  frequencies  which  are  found  in  these  400  ascents. 

It  will  be  noticed  that  the  P's  do  not  add  up  to  unity  unless  the 
amount  0-0025  is  included.  This  quantity  gives  the  probability  that 
an  ascent  should  be  made  without  obtaining  any  answer  heavier.  In  the 
example  these  "just  perceptibly  greater"  points  are  centred  at  the 
weight  112,  since  it  is  necessary  to  make  some  assumption  as  to  their 
position.  Strangely  enough,  Professor  Urban  himself  omits  them  in  his 
calculations,  causing  in  some  cases  quite  an  appreciable  error.  This 
difficulty  about  the  "tail"  of  the  P  distribution  would  not  have  been 
necessary  had  the  experiments  been  extended  to  a  point  where  all  the 
answers  were  heavier.  The  peculiar  difficulty  about  this  is  however  that 
the  psychological  conditions  would  thereby  be  changed*.  The  "tail" 
difficulty  will  therefore  always  be  present  in  threshold  measurements. 
It  causes  mathematical  troubles  in  all  the  methods  except  only  in  the 
process  of  calculation  used  in  the  Constant  Method,  and  will  be  fre- 
quently referred  to  in  this  and  the  succeeding  chapters. 

If  the  steps  between  the  stimuli  are  equal,  as  here,  a  saving  in 
arithmetic  which  escaped  Professor  Urban's  notice  can  be  effected  by 
using  the  summation  method  described  on  page  19.  The  sum  of  the 
q  products  gives  the  distance  of  the  threshold  from  the  end  stimulus. 
If  the  equal  increments  between  the  stimuli  are  not  unity  but  x,  the 
sum  of  the  q  products  must  first  be  multiplied  by  x.  The  standard 
deviation  can  also  be  obtained  by  a  further  application  of  the  summation 
process;  the  details  are  left  to  the  reader  who  should  consult  the  next 
chapter.  The  consideration  of  this  point  here  might  lead  us  too  far 
from  the  main  argument. 

*  See  inter  alia  the  article  "On  Judgments  of  Like,"  Frank  Angell,  Am.  Journ. 
Psychol.  1907,  xvm.  p.  263. 
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In  the  same  experiment  the  points  of  just  not  perceptible  difference 
were  distributed  as  follows: 


Grams 
Frequency 


84 
0 


88 

8 


92 
36 


96 


100 
200 


104 
35 


108 

22 


400  in  all. 


Of  these  the  mean  is  98-84  grams,  and  the  standard  deviation 
4-02  grams.  The  application  of  Urban's  Formula  in  this  case  gives  the 
following  results: 


Grams 

8 

P 

p  products 

P 

Ps 

Ps* 

80 
84 
88 
92 
96 
100 
104 
108 

-5 
-4 
-3 
-2 
-1 
0 
1 
2 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•0000 
•0001 
•0069 
•0773 
•3480 
•8419 
•9400 

•0000 
•0001 
•0068 
•0704 
•2707 
•4939 
•0981 
•0600 

•0000 
-•0004 
-  -0204 
-•1408 
-  -2707 

•0981 
•1200 

•0000 
•0016 
•0612 
•2816 
•2707 

.0981 
•2400 

working  unit 
origin 

2-2142 
4 

1-0000 

•2181 
-•4323 

•9532 
•0459  =T* 

8-8568 
108 

T=    --2142 
working  unit             4 

•9073=(r2 

•9525=0- 
4  working  unit 

99-1432 

origin 

-  -8568 
100 

3-8100  grams  =<r 

99-1432 

The  mean  threshold  is  therefore 


by  the  direct  method 
by  Urban's  Formula 


( 100-36  +  98-84)/2 =99-60  grams, 
(100-04  +  99- 14)/2  =99-59  grams. 


Professor  Urban's  own  calculated  values  are  slightly  different,  partly 
because  he  neglects  the  tail  of  the  distribution  as  above  explained, 
partly  because  of  an  arithmetical  error. 

The  practical  applications  of  Urban's  Formula  are  few,  though  it  is 
occasionally  useful  in  cases  where  the  direct  observation  of  the  just 
perceptible  points  is  impracticable*.  Its  great  value  lies  in  the  con- 
clusions which  it  enables  us  to  draw.  In  the  first  place  it  shows  clearly 
that  the  order  in  which  the  stimuli  occur  has  no  effect  on  the  value  of 
the  threshold  from  a  mathematical  point  of  view,  for  the  sum  of  the 
quantities  sP  is  independent  of  their  order,  and  the  P's  themselves  are 
equally  independent  thereof.  This  does  not  of  course  mean  that  a  change 
of  order  of  the  stimuli  will  have  no  psychological  effect.  There  is  no 
doubt  that  the  effect  of  a  non-consecutive  series  of  stimuli  will  be  very 

*  See  e.g.  G.  H.  Thomson,  "Changes  in  the  Spatial  Threshold  during  a  Sitting," 
Brit.  Journ.  Psychol.  1914,  vi.  p.  438. 
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different  from  that  of  an  ascending  or  descending  series,  inasmuch  as 
the  latter  will  arouse  feelings  of  expectation  and  the  like.  Professor 
Urban  therefore  recommends  that  except  when  it  is  specially  desired  to 
study  the  results  of  ascents  or  descents  the  stimuli  should  be  arranged 
in  non-consecutive  order.  In  fact  it  will  be  seen  that  the  above  described 
simple  mathematical  process  can  also  be  applied  to  data  collected  by  the 
Method  of  Eight  and  Wrong  Cases.  We  have  here  an  example  of  the 
distinction  between  methods  of  collecting  data  on  the  one  hand  and 
processes  of  calculation  on  the  other.  The  Limiting  Process  of  calcula- 
tion can  be  applied  to  the  Constant  Method  of  collecting  data. 

Professor  Urban  however,  although  recommending  non-consecutive 
stimuli,  does  not  recommend  that  the  data  should  be  collected  in  the 
way  usual  in  the  Constant  Method  if  it  is  intended  from  the  outset  to 
apply  the  Limiting  Process.  On  the  contrary,  instead  of  keeping  the 
stimuli  constant  throughout  the  experiment,  he  advises  a  frequent 
change  of  stimulus-series.  For  he  found,  from  consideration  of  the 
results  of  interpolating  and  of  irregularly  spacing  stimuli,  that  the  final 
values  obtained  by  the  Limiting  Process  are  dependent  upon  the  par- 
ticular stimuli  used,  in  a  mathematical  way.  To  free  results  from  this 
bias  as  far  as  possible,  he  recommends  the  following  procedure.  The 
steps  in  any  one  series  should  be  of  equal  size,  and  the  starting  point 
of  the  steps  should  be  altered  from  series  to  series  until  the  whole  area 
has  been  covered.  For  example,  if  the  spatial  threshold  were  being 
investigated,  the  stimuli  used  might  be* 

First  series     0,  1,  2,  3,  4,  5,  6,  7,  8  centimetres; 
second  series  0-1,  1-1,  2-1,  3-1,  4-1,  5-1,.  6-1,  7-1,  8-1; 
third        „      0-2,  1-2,  2-2,  3-2,  4-2,  5-2,  6-2,  7-2,  8-2; 


tenth       „      0-9,  1-9,  2-9,  3-9,  4-9,  5-9,  6-9,  7-9,  8-9. 

The  series  might  of  course  follow  each  other  in  any  order.  When  this 
is  done,  Professor  Urban  shows  that  the  final  limen  arrived  at  is  the 
point  where  the  probability  of  the  response  in  question  is  one-half. 
That  is,  the  Limiting  Method  finds  the  median  thresholdf . 

A  modification  of  the  Method  of  Limits,  which  effects  a  more  com- 
plete elimination  of  the  expectation  error  by  the  use  of  "catch  experi- 
ments," where  the  variable  given  is  equal  to  the  standard,  and  which 

*  See  G.  H.  Thomson,  "Changes  in  the  Spatial  Threshold  during  a  Sitting,"  Brit. 
Journ.  Psychol.  1914,  vi.  pp.  435 — 6,  where  this  plan  is  followed. 

t  For  other  discussions  of  the  mathematics  of  the  Limiting  Process  and  allied  subjects 
the  reader  may  consult  articles  by  Wirth,  Urban,  Lipps  in  the  Archivf.  d.  ges.  Psychologic, 
etc. 
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also  collects  data  at  a  quicker  rate  than  the  ordinary  Method  of  Limits, 
is  that  known  as  the  Method  of  Serial  Groups*. 

Each  fixed  value  of  the  variable  stimulus  is  presented  with  the 
standard  ten  times,  not  in  immediate  succession,  but  interspersed  at 
random  among  ten  other  values  of  V  equal  to  S.  The  percentage  of 
correct  answers  given  by  the  subject  is  noted,  and  the  experimenter 
passes  on  to  the  next  value  of  F  which  is  presented  along  with  catch 
stimuli  in  a  similar  manner.  The  value  of  V  which,  as  presented  in  this 
way,  gives  80  %  right  answers,  is  arbitrarily  chosen  as  measuring  the 
limen.  Of  course  this  choice  of  80  %  makes  the  method  measure  a 
totally  different  point  from  the  50  %  point  measured  by  the  Method  of 
Limits,  but  this  is  not  essential  to  the  method  as  a  method  of  collecting 
dataf.  As  such  it  is  a  very  convenient  one  to  use  in  measuring  a  large 
number  of  subjects  in  "mental  test"  experiments,  or  with  primitive 
people,  where  economisation  of  time  is  essential. 

The  mathematical  theory  of  this  method  is  similar  to  that  of  the 
Method  o'f  Limits J.  It  has  been  shown  that  mathematically  the  Method 
of  Limits  is  superior,  and  that  the  mathematical  disadvantages  increase 
with  the  size  of  group  taken.  For  this  reason  groups  of  four  have  been 
suggested  instead  of  ten  §.  Further,  the  arbitrary  use  of  the  80  %  point 
obscures  comparison  with  the  results  of  other  methods.  The  50  %  point 
would  be  better  although  it  does  not  allow  so  large  a  number  of  subjects 
to  be  measured  in  a  short  time  as  does  the  80  %  point,  if,  as  is  assumed, 
each  descent  is  stopped  as  soon  as  the  required  point  is  reached,  and 
a  new  descent  begun.  With*  groups  of  four,  the  75  %,  50  %  and 
25  %  points  can  all  be  noted  if  time  permits  of  complete  descents 
and  ascents  (thus  giving  both  the  median  limen  and  a  measure  of 
scatter,  the  interquartile  range),  while  if  time  pressed,  the  75  %  point 
would  be  sufficiently  comparable  with  the  80  %  point  of  previous 
experiments. 

As  in  the  Method  of  Limits,  the  mathematical  foundations  are 
unaltered  if  the  groups  cease  to  be  in  serial  order.  We  thus  arrive  at 

*  See  Text  Book  of  Experimental  Psychology,  C.  S.  Myers,  Cambridge,  1911,  p.  196. 

t  See  G.  M.  Stratton  (Psychol.  Review,  1902,  ix.  pp.  444 — 447),  who  says  "a  detail 
like  this,  as  well  as  the  exact  number  of  experiments  that  may  best  form  a  group,  might 
well  be  considered  as  subject  to  revision  in  the  light  of  further  experience  and  not  as  an 
essential  part  of  the  method." 

J  G.  H.  Thomson,  "A  Comparison  of  Psychophysical  Methods,"  Brit.  Journ.  Psychol. 

1912,  v.  p.  212. 

§  "An  Inquiry  into  the  Best  Form  of  the  Method  of  Serial  Groups,"  Brit.  Journ. 
Psychol  1913,  v.  pp.  398 — 416;  and  "The  Probable  Error  of  Urban's  Formula,"  ibid. 

1913,  vi.  pp.  217—222. 
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a  Method  of  Non-Consecutive  Groups*,  which  has  been  held  by  one 
experimenter  to  be  the  best  method  of  collecting  data. 

It  may  be  pointed  out  in  passing  that  the  Method  of  Groups  is  one 
which  is  naturally  employed  in  other  branches  of  mental  measurement 
as  well  as  in  psychophysics.  For  example,  the  widely  known  Binet  Tests 
are  given  in  groups  beginning  at  a  group  designed  to  suit  a  child  of  very 
young  age,  and  are  proceeded  with  until  a  certain  percentage  of  passes 
is  obtained  at  some  group.  Usually  however  modifications  in  the  form 
of  marks  for  tests  passed  above  the  critical  group  are  admitted.  There 
is  no  doubt  but  that  the  mathematical  foundations  of  many  of  these 
devices  require  examination  in  the  light  of  the  theory  of  probability^. 

It  may  also  be  pointed  out  that  as  methods  of  collecting  data  the 
Method  of  Serial  Groups,  and  that  of  Non-Consecutive  Groups,  especially 
the  latter,  approach  the  principle  of  the  constant  method  yet  to  be 
described,  and  indeed  their  data  can  very  well  be  handled  by  the  pro- 
cesses in  use  in  the  latter  method. 

(3)  THE  METHOD  OF  AVERAGE  ERROR 

In  this  method  the  subject  is  required  to  adjust  a  variable  stimulus 
so  that  it  seems  subjectively  equal  to  a  given  standard  stimulus.  In  this 
it  differs  considerably  from  the  other  methods  in  which  the  experimenter 
does  the  adjustment,  usually  out  of  sight  or  even  before  the  sitting 
starts,  and  the  subject  expresses  an  opinion  on  each  stimulus  but  does 
not  alter  it.  The  alternative  name  of  the  present  method,  viz.  the  Method 
of  Production,  suitably  emphasises  this  important  psychological  point. 

Mathematically  the  method  presents  differences  which  are  mainly 
due  to  the  fact  that  in  this  method  almost  any  value  of  the  variable 
stimulus  can  crop  up,  whereas  in  the  other  methods  the  experimenter 
customarily  keeps  to  certain  steps,  so  that  the  results  are  as  it  were 
heaped  up  at  certain  points. 

The  experiment  is  repeated  a  large  number  of  times — at  least  100 — 
and  the  arithmetical  mean  of  all  the  obtained  stimulus-values  is  calcu- 
lated. The  difference  between  this  mean  value  and  the  standard  stimulus 
is  known  as  the  crude  constant  error  e.  It  may  be  either  positive  or 
negative.  A  measure  of  scatter  is  also  found. 

The  crude  constant  error  may  be  partly  due  to  a  space  error  (the 
time  error  cannot  occur  in  this  method),  partly  to  other  constant 

*  Thomson,  Brit.  Journ.  Psychol.  1912,  v.  p.  205.  See  also  Brit.  Journ.  Psychol.  1914, 
VT.  p.  434,  where  this  method  was  employed. 

t  See  Francis  N.  Maxfield,  "Some  Mathematical  Aspects  of  the  Binet-Simon  Tests," 
Journal  of  Educational  Psychology,  1918,  ix.  pp.  1—12. 
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conditions*.  Let  us  assume  that  the  experiment  is  to  adjust  the  length 
of  a  variable  line  until  it  seems  to  be  equal  in  length  to  a  standard  line. 
In  this  case,  the  standard  should  be  situated  to  the  right  in  one-half 
the  number  of  adjustments,  and  to  the  left  in  the  other  half,  either 
alternately  or  in  haphazard  order.  The  results  are  tabulated  in  two 
columns,  I  standard  to  the  right,  II  standard  to  the  left,  and  the  means 
of  these  two  found  separately.  Half  the  difference  of  these  means  is 
the  space  error,  while  half  their  sum,  less  the  standard,  gives  the  residual 
constant  error. 

In  order  to  give  as  much  definiteness  as  possible  to  the  task  of 
adjustment,  the  variable  should  start  sometimes  shorter  than  the 
standard,  sometimes  (an  equal  number)  longer,  and  the  adjustment  be 
made  by  lengthening  or  shortening  respectively.  Again,  the  requisite 
amount  of  shortening  or  lengthening  should  be  arranged  to  be  different 
on  different  occasions,  but  alternating  with  some  degree  of  regularity. 

The  value  of  the  scatter  obtained  in  this  method  is  from  a  general 
point  of  view  a  more  important  result  than  the  value  of  the  constant 
errors,  since  it  has  often  been  regarded  as  proportional  to  the  value  of 
the  difference  threshold  as  determined  by  the  other  two  psychophysical 
methods.  The  truth  is  that  although  there  is  a  certain  amount  of 
proportionality  between  the  values,  this  proportionality  is  not  complete. 
Under  certain  conditions  the  two  values  vary  in  opposite  directions-)-. 
It  is  hardly  necessary  to  point  out  that  the  scatter  of  the  individual  points 
obtained  in  the  Method  of  Limits  has  but  little  relation  to  that  reached 
by  the  Average  Error  Method.  They  are  not  entirely  unrelated,  however. 

The  closest  correspondence  of  any  is  that  between  the  distribution 
of  the  errors  in  the  present  method  and  that  of  the  judgments  "equal" 
in  the  Constant  Method. 

(4)  THE  CONSTANT  METHOD 

This  is  generally  regarded  as  the  most  satisfactory  of  the  psycho- 
physical  methods.  It  can  be  employed  with  equal  convenience  for  the 
determination  of  absolute  thresholds,  difference  thresholds,  equal- 
appearing  sense-distances,  and  other  measurements  of  psychological 
importance.  The  different  values  of  the  variable  stimulus  to  be  employed 
are  fixed  once  for  all  at  the  beginning  of  the  investigation,  and  are 
presented  to  the  subject  a  large  number  of  times  (say  100  applications 

*  Cf.  E.  B.  Titchener,  "Experimental  Psychology,"  Students  Manual,  n.  p.  74:  "A 
constant  error  is  simply  an  error  whose  conditions  are  constant;  its  amount  may  vary, 
quite  considerably,  from  stage  to  stage  of  a  long  series  of  experiments." 

t  This  question  is  connected  with  the  point  discussed  on  p.  75  et  seq. 
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of  each)  in  irregular  order,  or  in  a  prearranged  order,  unknown  to  the 
subject,  corresponding  to  certain  precautions.  If  an  absolute  threshold 
is  being  determined,  the  variable  is  presented  alone;  if  a  difference 
threshold,  it  is  on  each  occasion  preceded,  accompanied,  or  followed  by 
the  standard.  In  the  latter  case,  the  subject  returns  the  replies  greater, 
uncertain  or  equal,  less,  with  reference  either  to  the  standard,  or  to  the 
variable,  or  to  the  first  presented  stimulus,  or  on  some  other  prearranged 
plan.  The  percentage  of  each  of  these  three  types  of  answers  is  deter- 
mined for  each  value  of  the  variable  used,  and  recorded  (it  is  also 
advisable  to  retain  the  raw  data  in  the  form  of  the  actual  answers  in 
the  order  given). 

As  one  illustration  of  this  method,  we  shall  find  it  convenient  to 
refer  to  a  series  of  results  obtained  by  Riecker  (Zeitschrift  f.  Biologie, 
Bd.  x.)  which  has  already  served  in  the  descriptions  of  the  method 
given  by  Miiller,  Titchener,  and  Myers.  Riecker  obtained  the  following 
results  in  an  investigation  of  the  "spatial  threshold,"  or  threshold  of 
two-point  judgments  of  the  skin  of  the  lower  eyelid: 

a  (distance  between  the  points  of  \ 

the    aesthesiometer    in    Paris  I        0          0'5      1  1-52         3        4        5          6 

lines)*  j 

IWp  (%  of  two-point  judgments)       30         10       14        40      65      80      87      96       100 

15     ^20      4        26       26      15       7        9       T~ 

It  will  be  observed  that,  with  two  exceptions,  the  series  of  per- 
centages follows  a  general  law  of  increase,  the  rate  of  increase  itself 
increasing  at  first  and  then  diminishing.  The  numbers  in  the  lowest 
line  are  formed  by  subtracting  each  percentage  from  the  immediately 
succeeding  one,  and  show  this  uniformity  more  clearly.  The  two  excep- 
tions are  at  s  =  0  and  s  —  5.  At  s  —  0  the  percentage  is  greater  than  at 
the  immediately  succeeding  stimulus.  This  irregularity  is  known  as  an 
inversion  of  the  first  order.  The  cause  of  it  is  doubtless  to  be  looked  for 
in  the  exceptional  way  in  which  the  stimulus  (one  point)  may  have  been 
applied,  the  pressure  and  general  nature  of  the  contact  may  have  been 
different  from  what  they  were  with  two-point  contact,  or  some  mis- 
leading suggestion  may  have  accompanied  these  particular  experiments. 

At  s  =  5,  the  percentage  is  indeed  larger  than  that  for  s  =  4  and 
smaller  than  that  for  s  =  6,  but  reference  to  the  differences  shows  that 
the  increase  from  4  to  5  is  greater  than  the  increase  from  3  to  4,  thus 
breaking  the  general  rule  as  regards  rate  of  increase.  This  irregularity 
is  known  as  an  inversion  of  the  second  order,  and  being  in  the  present 

*  A  Paris  line =2-26  mms. 
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case  slight,  is  probably  to  be  explained  as  due  to  an  insufficiency  in  the 
number  of  applications  of  the  stimulus. 

As  a  second  example,  in  this  case  one  dealing  with  a  difference 
threshold,  we  shall  take  the  data  for  lifted  weights  which  have  already 
been  used  on  p.  51  in  connection  with  the  Method  of  Limits.  The 
standard  weight  was  100  grams  and  was  lifted  before  each  of  the  seven 
comparison  weights.  The  judgments  given  were  lighter  than,  equal  to, 
or  heavier  than  the  standard.  With  Subject  I  the  answers  heavier  were 
distributed  as  follows,  in  450  trials  with  each  weight: 

Comparison  weight  s         84  88  92  96  100  104         108  grams 

Answers  heavier  1  9  40  100          186          403        423 

Proportions  p  -0022       -0200       -0889       -2222       -4133       -8956      -9400 

The  third  row  of  this  table  is  simply  the  second  row  divided  by  450*. 
A  figure  of  the  quantities  p  forms  a  curve  which,  following  Galton,  we 


•75 


a.  -5 


•25 


84 


88 


100        104        108  grams 


92         96 

stimulus  s 

Fig.  7.   Urban's  Subject  1.  p= proportion  of  answers  heavier 

may  describe  as  an  ogive.  It  is  shown  in  Fig.  7,  where  however  no 
attempt  is  made  to  draw  a  smooth  curve  through  the  points,  which  are 
merely  joined  by  straight  lines. 

Here  there  are  no  inversions  of  either  order,  and  the  example  is 
therefore  a  more  convenient  one  for  a  first  explanation  of  the  various 
processes  of  calculation  which  may  be  adopted.  We  shall  for  this  reason 
deal  with  it  first,  returning  later  and  considering  Riecker's  more  irregular 
numbers. 

We  must  commence  by  defining  the  limen,  and  do  so  in  the  first 

*  This  figure  is  correct.  But  (as  stated  on  page  51)  Urban  only  gives  400  values  of 
the  just  perceptibly  heavier  points:  we  are  unaware  of  the  reason  which  led  him  to  discard 
the  other  50. 
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place  as  the  point  corresponding  to  50  %  of  the  judgments  in  question. 
For  example,  in  our  case  we  define  the  limen  for  the  response  heavier 
as  being  the  point  where  the  subject  would  give  half  his  answers  heavier, 
the  other  half  being  lighter,  equal,  or  whatever  other  answer  is  allowed, 
but  not  heavier.  Above  this  point  he  is  more  likely  to  answer  heavier 
than  not,  below  it  he  is  more  likely  not  to  give  this  reply. 

The  limen  corresponding  to  50  %  heavier  judgments  may  be  calcu- 
lated in  two  general  ways:  (A)  from  the  observed  values,  (B)  by  finding 
the  best  fitting  smooth  curve,  adjusting  the  observations  to  this,  and 
then  calculating  the  constants  (mean,  mode,  scatter,  etc.)  from  the  curve. 

A  (1).    Linear  Interpolation 

The  required  limen  obviously  falls  somewhere  between  100  grams 
(41-33  %  answers  heavier)  and  104  grams  (89-56  %  answers  heavier). 
A  very  simple  way  therefore  of  determining  its  value  is  to  assume  these 
points  joined  by  a  straight  line,  as  in  Fig.  7,  and  find  where  this  line 
cuts  the  50  %  line.  Arithmetically  this  means  finding  a  weight  which 
divides  the  interval  from  100  to  104  grams  in  the  same  proportion  as 
50  %  divides  the  interval  between  41-33  %  and  89-56  %.  If  T  be  this 
weight,  we  have  then 

T  -  100      50  -  41-33  • 

104-  T~  89-56-  50' 

and  with  practically  no  calculation  we  obtain  the  value  100-72  grams 
for  the  limen. 

This  method  though  commendably  simple  is  open  to  several  ob- 
jections: 

(1)  It  does  not  employ  all  the  data ;  it  uses  two  of  the  percentages 
only. 

(2)  The  assumption  that  the  curve  is  a  straight  line  at  this  point 
is  unlikely  to  be  exact. 

(3)  It  gives  no  measure  of  scatter. 

A  very  simple  extension  of  the  above  idea  has  been  suggested* 
which  obviates  the  third  and  to  some  extent  the  first  of  these  objections. 
This  is  to  calculate  the  75  %  and  the  25  %  points  by  the  same  simple 
form  of  linear  interpolation  as  that  employed  for  the  limen  itself,  that 
is,  on  the  Fig.  7,  to  find  where  the  zig-zag  ogive  crosses  the  -25  and 
•75  lines.  These  distances  are  readily  found  by  a  short  calculation 
to  be  96-58  and  102-79  grams  respectively.  Half  the  interval  between 

*  G.  H.  Thomson,  "A  Comparison  of  Psychophysical  Methods,"  Brit.  Journ.  Psychol. 
1912,  v.  p.  210  footnote. 
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them,  namely  3-10  grams,  is  the  semi-interquartile  range,  a  rough  but 
very  practical  measure  of  scatter. 

The  fact  that  the  50  %  point  is  not  half-way  between  the  25  %  point 
and  the  75  %  point  is  a  rough  indication  of  the  skewness  of  the  data : 
it  divides  the  interquartile  range  in  the  proportion  4-14  grams  below 
and  2-07  grams  above. 

Moreover,  it  will  be  found  that  in  practice  the  mean  of  the  three 
values,  the  25  %,  50  %,  and  75  %  points,  gives  a  fairly  good  approxi- 
mation to  the  threshold  found  by  more  complicated  calculations  assuming 
a  symmetrical  distribution.  In  the  present  case  this  gives 

96-58  +  100-72  +  102-79 

— -  =  100-03  grams. 

o 

A  (2).    The  Arithmetical  Mean*  (Spearman's  Formula) 

A  value  which  can  be  calculated  by  a  use  of  all  the  data  is  that  of 
the  mean  or  average  limen.  Before  proceeding  to  consider  this,  it  is 
important  to  realise  clearly  the  fact,  which  G.  E.  Miillerf  was  the  first 
to  point  out  and  emphasise,  that  a  limen  is  a  variable  magnitude 
following  a  law  of  frequency-distribution.  There  is  no  fixed  limen,  only 
an  average  limen,  a  most  frequent  limen,  or  a  most  representative  limen. 
The  p's  in  the  table  (p.  59)  represent  the  relative  frequency  of  limina 
for  stimuli  below  the  corresponding  s.  Thus  there  were  41-33  %  limina 
below  100  grams,  and  22-22  %  limina  below  96  grams;  i.e.  there  were 
41-33  —  22-22  or  19-11  %  limina  between  the  limits  of  stimulus-values 
96  grams  and  100  grams.  This  suggests  the  plan  of  plotting  a  frequency- 
polygon  or  histogram  for  the  limina  using  these  differences  of  the  p's. 
For  the  present  case  the  table  of  differences  runs : 

Below  84  grams         1,  or  -0022  of  the  whole 

84—88  8     „  -0178 


88—92 

92—96 

96—100 

100—104 

104—108 

Above  108 


31     „  -0689 

60    „  -1333 

86    „  -1911 

217    „  -4823 

20    „  -0444 

27J  „  -0600 


*  "The  Method  of  Right  and  Wrong  Cases  without  Gauss  s  Formulae,"  Brit.  Journ. 
Psychol  1908,  n.  pp.  227—242. 

t  This  at  least  is  the  view  to  which  Miiller  himself  inclines  (Gesichtspunkte,  p.  59), 
but  he  deduces  his  formulae  on  the  assumption,  supported  by  Fechner  and  Bruns,  that 
the  threshold  has  a  single  definite  value,  subject  in  the  course  of  an  experimental  deter- 
mination to  variable  apparent  increase  or  decrease  by  random  influences  which  obey  the 
Normal  Law  of  error.  He  points  out  that  both  views  lead  to  the  same  formulae.  Indeed 
the  distinction  is  merely  a  verbal  one,  in  our  opinion. 

J  This  is  not  necessarily  an  inversion  of  the  second  order,  for  it  may  be  spread  out 
to  any  distance  above  108  grams. 
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and  the  histogram,  or  pseudo-histogram  *  as  it  is  safer  to  call  it,  is  given 
in  Fig.  8. 

If  we  now  wish  to  proceed  to  the  calculation  of  the  mean  of  these 
limina  we  must  decide  where  to  centre  those  which  lie  between  say 
92  and  96  grams.  It  is  well  known  that  this  centre  must  be  taken  in 
the  actual  geometrical  centre  of  each  intervalf ,  and  Professor  Spearman's 
suggestion  to  use  not  the  mid-ordinate,  but  an  ordinate  shifted  slightly 
towards  the  higher  of  the  two  neighbouring  columns,  is  unnecessary. 
The  unadjusted  value  is  the  best  value  of  the  mean.  The  source  of 
Professor  Spearman's  error  lies  in  the  fact  that  in  his  discussion  he 
replaces  each  rectangle  of  the  pseudo-histogram  by  a  trapezium  of 
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217 


88          92          96         100        104        108  grams 
Fig.  8.   Pseudo-histogram  of  the  450  limina.   Urban's  Subject  I,  answers  heavier 

different  area,  and  then  multiplies  the  centroid  abscissa  of  this  new  area 
by  the  old  area,  which  could  not  possibly  have  the  centroid  of  the  new 
one. 

A  real  difficulty,  and  one  which  robs  the  method  of  much  of  the 
practical  utility  which  it  would  otherwise  have,  is  the  impossibility  of 
fixing  in  any  unambiguous  fashion  the  centres  of  the  tails  of  the  distribution, 
that  is  those  limina  which  in  our  example  lie  below  84  and  above  108 
grams  respectively.  This  is  the  same  trouble  as  that  referred  to  on 
p.  52  in  connection  with  the  application  of  Urban's  Formula. 

*  See  later,  p.  79. 

t  Cf.  e.g.  W.  F.  Sheppard,  "On  the  Calculation  of  the  most  probable  Values  of  Fre- 
quency Constants,"  Proc.  Land.  Math.  Soc.  xxix.  pp.  353  ff. 
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The  simplest  thing  to  do  is  to  assume  that  at  80  grams  the  subject 
would  never,  and  at  112  grams  would  always,  have  answered  heavier, 
i.e.  to  centre  the  tails  at  82  and  at  110  grams  respectively.  We  shall 
do  so  in  this  case,  although  it  is  clear  that  at  the  upper  end  the  tail  is 
probably  more  spread  out  than  this.  We  then  have  the  following  calcu- 
lation for  the  mean  or  average  limen : 

1  at  82=   82 

8  „  86=  688 

31  „  90=  2790 

60  „  94=  5640 

86  „  98=  8428 

217  „  102=22134 

20  „  106=  2120 

27  „  110=  2970 

44852  total 

Dividing  this  by  the  whole  number  of  limina,  450,  we  obtain  99-67  grams 
as  the  threshold  by  this  method.  The  standard  deviation  can  also  be 
found,  cr  =  5-1  grams  (but  see  next  chapter  for  a  correction  to  this 
value,  known  as  Sheppard's  adjustment). 

The  summation  method  of  finding  the  mean,  which  is  explained  on 
p.  19,  again  enables  a  saving  in  arithmetic  to  be  effected  in  this  calcula- 
tion. Applied  directly  to  the  frequencies  p  it  works  as  follows  in  the 
present  case,  and  obviates  the  actual  formation  of  the  pseudo- histogram. 

Frequency  of  answers  heavier  at    84  grams         '0022 

88    „  -0200 

92    „  -0889 

96    „  -2222 

100    „  -4133 

104    „  -8956 

108    „  -9400 

2-5822  sum 

4  gram  units 

10-3288 
This  must  be  subtracted  from  origin     110 

99-6712  grams  threshold. 

The  reader  must  be  referred  to  a  study  of  the  summation  method  on 
p.  19  to  realise  why  in  this  case  the  sum  obtained  has  to  be  subtracted 
from  the  origin. 

We  can  at  this  point  call  attention  to  an  instructive  comparison 
which  can  be  drawn  between  the  Method  of  Limits  and  the  Method  of 
Right  and  Wrong  Cases,  as  regards  the  processes  of  calculation  employed 
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in  them,  on  the  basis  of  Urban's  Formula  for  the  former  and  Spearman's 
Formula*  for  the  latterf. 

Urban's  Formula,  S  (sP),  shows  the  threshold  as  the  mean  of  a 
number  of  just  perceptible  points,  which  are  centred  at  the  stimuli  used, 
the  quantities  P  being  the  differences  of  successive  products  of  the 
frequencies  of  the  answers  heavier  (or  not  heavier). 

Spearman's  Formula,  as  exemplified  in  the  above  calculations,  can 
be  written  S(s.dp),  where  dp  is  the  difference  of  the  successive  fre- 
quencies p,  and  shows  the  threshold  as  the  mean  of  a  number  of  limina 
which  lie  between  the  stimuli  used. 

B.    Methods  which  fit  smooth  curves  to  the  data 

The  idea  underlying  these  methods  is  to  run  a  smooth  curve  through 
the  points  of  Fig.  7  (the  p  values)  instead  of  simply  joining  them  by 
a  zigzag,  and  then  to  ascertain  where  this  smooth  curve,  which  does 
not  necessarily  go  exactly  through  any  of  the  points  but  smooths  off 
any  inequalities,  passes  the  50  %  line.  Any  suitable  curve  which 
happened  to  occur  to  one  might  of  course  be  employed.  For  example, 
a  parabola  of  high  order  can  be  used J,  and  the  curve  tan"1  6  has  also 
been  tried §.  But  clearly  the  whole  experiment  suggests  that  an  error 
function  of  some  sort  is  wanted,  and  as  early  as  1860  G.  T.  Fechner 
suggested  1 1  that  such  numbers  formed  the  integral  of  a  Normal  Curve 
of  Error. 

This  idea  would  naturally  occur  to  anyone  accustomed  to  handling 
the  Normal  Curve  on  considering  the  pseudo-histogram  or  table  of 
differences,  Fig.  8.  The  obvious  skewness  of  the  diagram  would  also 
strike  such  an  observer,  it  may  be  noted  in  passing,  but  with  this  point 
we  shall  deal  in  a  separate  chapter. 

The  obvious  way  to  fit  such  a  histogram  with  a  Normal  Curve  is 
that  given  in  the  previous  chapter,  namely,  to  find  the  mean  and  the 
standard  deviation,  and  use  these  constants  in  the  expression  for  the 
curve.  To  this  there  are  however  important  practical  objections,  the 
chief  being  the  difficulty  of  the  undefined  tails,  to  which  reference  has 

*  We  use  Professor  Spearman's  name  for  this  plan  since  he  was  the  first  English 
writer,  we  believe,  to  call  attention  to  it:  but  we  do  not  include  the  erroneous  point  in 
his  calculation  to  which  reference  has  already  been  made. 

t  G.  H.  Thomson,  "A  Comparison  of  Psychophysical  Methods,"  Brit.  Journ.  Psychol. 
1912,  v.  p.  226. 

%  F.  M.  Urban,  "Die  psychophysischen  Massmethoden  als  Grundlagen  empirischer 
Messungen,"  Archiv  f.  d.  ges.  Psychologie,  1909,  xv.  and  xvi.  pp.  335—356. 

§  Urban,  loc.  tit.  p.  393  et  seq. 

||  G.  T.  Fechner,  Elemente  der  Psychophysik,  1860. 
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already  been  made  in  discussing  Spearman's  plan  of  finding  the  mean. 
This  difficulty  is  still  more  acute  when  we  attempt  to  find  the  standard 
deviation.  This  tail  difficulty  does  not  occur  in  the  plan  adopted  by 
Muller*,  which  fits  the  Normal  Integral  direct  to  the  p  values,  not 
troubling  at  all  about  the  differences  forming  the  histogram.  In  other 
words,  Muller  's  process  fits  a  curve  to  Fig.  7,  not  to  Fig.  8,  and  is 
therefore  more  direct,  in  addition  to  the  advantage  it  has  of  avoiding 
the  tail  problem. 

Before  proceeding  to  the  explanation  of  Muller's  process,  it  is 
necessary  to  notice  that  he  employed  a  slightly  different  form  of  the 
equation  to  a  Normal  Curve  from  that  used  in  the  previous  chapter. 
The  latter  is  the  form  now  in  general  use  among  biometricians,  and  it 
seems  desirable  that  it  should  also  be  used  by  psychometricians,  who 
otherwise  would  be  hindered  from  the  direct  application  to  their  work 
of  the  mathematical  improvements  made  by  the  biometric  school,  and 
especially  would  find  their  use  of  the  valuable  tables  published  by 
Professor  Pearson  much  hampered.  In  the  actual  description  of  Muller's 
work  however  it  is  better  for  the  present  to  keep  to  his  notation  in  this 
respect.  The  form  of  the  Normal  Curve  used  by  him  was 


In  this  s  is  the  variable  stimulus,  T  is  the  average  limen,  and  therefore 
also,  since  the  curve  is  symmetrical,  the  median  and  the  mode.  So  far 
the  notation  agrees  with  that  used  in  the  expression  employed  for  the 
Normal  Curve  in  the  previous  chapter,  namely 


but  a  further  comparison  of  the  two  shows  that  for  his  second  constant 
Muller  has  a  quantity  h,  which  is  connected  with  the  standard  devia- 
tion or  of  the  biometric  formula  by  the  relationship 


.. 

When  the  standard  deviation  is  large,  therefore,  h  is  small.    It  is  a 
measure  of  precision,  not  of  scatter. 

The  assumption  is  now  made  that  the  relationship  between  the 

*  G.  E.  Miiller,  "Ueber  die  Maassbestimmungen  des  Ortssinnes  der  Haut  mittels  der 
Methode  der  richtigen  und  falschen  Falle,"  Pfluger's  Archiv  fur  die  ges.  Physiologic,  1879, 
xix.  pp.  191  —  235,  especially  par.  5  et  seq.:  also  Die  Gesichtspunkte  und  die  Tatsachen 
der  psychophysischen  Methodik,  Wiesbaden,  1904,  par.  11,  where  the  classical  description 
of  this  method  will  be  found. 

B.  &  T.  5 
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stimulus  5,  and  the  frequency  p  with  which  the  answer  heavier  is  returned, 
is  given  by  the  equation 


That  is  to  say,  it  is  assumed  that  the  successively  increasing  percentages 
of  answers  heavier  correspond  to  the  increasing  area  of  the  portion  of 
a  Normal  Curve  which  is  shaded  in  Fig.  9,  as  the  point  s  moves  to 
the  right  in  that  figure. 

To  obtain  this  equation  in  a  more  convenient  form  for  our  purpose, 

wnte  h(s-T)  =  t  ......  (2). 

This  corresponds  to  measuring  the  stimuli  in  a  special  unit,  and  is  the 


Fig.  9.  To  illustrate  Miiller's  process 

same  device  as  that  used  in  another  connection  later  by  Galton.    The 
equation  then  becomes 


-T) 


By  inserting  in  this  equation  the  corresponding  values  of  p  and  s 
from  the  table  of  data  on  p.  59,  we  obtain  seven  equations  for  two 
unknowns  h  and  T,  and  as  these  equations  are  slightly  inconsistent  with 
one  another  we  have  to  decide  how  to  calculate  the  most  probable 
values  of  h  and  T.  No  pair  of  values  will  exactly  satisfy  all  seven 
equations.  Instead  of  coming  to  zero  they  leave  small  residuals  v. 
Muller  adopted  the  Method  of  Least  Squares,  an  account  of  which  will 
be  found  on  p.  44  in  Chapter  II. 

In  passing  a  note  must  be  made  of  the  fact  that  Muller  assumed 
tacitly  that  these  observation  equations,  being  each  based  on  the  same 
number  of  experiments,  are  of  equal  importance  or  "  weight*."  We  shall 
allow  this  assumption  to  pass  for  the  present  but  shall  return  to  it  later. 

Unfortunately,  the  equations  are  very  far  from  being  simple  and 

*  There  is  unfortunately  a  possibility  of  ambiguity  here  owing  to  the  fact  that 
weights  are  used  as  stimuli. 
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linear  as  in  the  example  on  pages  44  and  45.  To  avoid  this  difficulty, 
we  look  up  in  tables  of  the  Probability  Integral*  those  values  of 

Y  =  h(s-T)  (4) 

which  correspond  exactly  to  our  values  of  p. 

These  equations  are  not  yet  linear  in  T  and  h,  though  much  simpler. 
If  however  we  write  c  =  hT 

they  become  y  —  hs  +  c  =  0  (5)f 

and  are  now  linear  in  h  and  c.  If  we  now  insert  any  pair  of  values 
h  and  c  into  these  seven  (or  in  general,  n)  equations,  these  also  will 
leave  residuals  u,  different  from  those  considered  above  in  connection 
with  the  equations  (3).  If  we  were  now  to  proceed  to  make  S  (u2} 
a  minimum,  this  would  not  effect  our  purpose.  It  is  S  (v2)  we  wish  to 
make  a  minimum,  not  S  (u2).  If  however  we  can  find  multipliers  or 
"weights"  M  such  that  each 

M  u2  =  v2, 

then  we  can  make  S  (Mu2) 

a  minimum.  That  is,  we  can  apply  Least  Squares  to  the  equations  (5) 
weighted  with  certain  artificial  weights  (in  addition  to  any  weights 
which  may  possibly  be  necessary  by  reason  of  there  being  different 
numbers  of  experiments  at  the  different  stimulus- values).  The  use  of 
this  device  of  artificial  weights  to  overcome  the  complexity  due  to  the 
non-linear  equations  is  Muller's  particular  credit  in  this  connection. 

Clearly  the  residuals  v,  which  may  be  regarded  as  errors  in  p,  are 
connected  with  the  residuals  u,  which  may  be  regarded  as  errors  in  y, 

by  the  equation  1 

-_  e-v*  u  =  v 

i/W 

from  equations  (3)  and  (4).   Therefore 

M  =  e-^JTT. 

Herein  we  can  omit  the  TT  since  it  is  only  the  relative  values  of  the 
Miiller  weights  which  are  of  importance.  These  weights  are,  by  reason 
of  the  improved  weights  to  be  shortly  described,  of  only  historical 
interest.  The  condition  that  S  (v2)  should  be  a  minimum  has  now 

*  The  table  of  the  Probability  Integral  commonly  used  by  psychologists  is  that  known 
as  Fechner's  Fundamental  Table,  which  is  given  in  Appendix  I.  It  is  desirable  that 
psychologists  should  make  the  slight  changes  in  notation  necessary  to  enable  them  to 
use  better  and  more  generally  accessible  tables.  For  example,  the  first  table  in  Pearson's 
Tables  for  Biometricians  and  Statisticians  is,  except  for  a  factor  \/2,  identical  in  significance 
with  Fechner's,  and  gives  more  values,  to  more  decimal  places. 

f  Note  that  this,  like  (3),  represents  a  set  of  equations,  each  of  this  form.  In  our 
example  there  are  seven  such  equations. 

5—2 
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become  that  S  (Mu2)  should  be  a  minimum.    With  this  substitution, 
the  equations  (5)  give  the  two  Normal  Equations 

S  (Msy)  -  S  (Ms2)  h  +  S  (Ms)  c  =  0\ 
S  (My)  +  S  (Ms)  h-S(M)c^O      j 
Thence  we  have 

S  (Ms)  S  (Msy)  -  S  (My)  S  (Ms2) 
S  (M)  S  (Ms2)  -  S  (Ms)2 


S  (M)  S  (Msy)  -  S  (Ms)  S  (My) 


,.(7). 


S  (M)  S  (Ms2)  -  S  (Ms)2 
T  =  c/h 

The  use  of  these  formulae  is  best  explained  by  an  example.  Before 
giving  one,  however,  it  is  well  to  describe  a  modification  in  the  weights  M 
which  was  introduced  in  1909  by  Professor  F.  M.  Urban*,  as  it  is  with 
Urban's  weights,  and  not  with  Muller's,  that  we  shall  actually  work. 

These  alterations  in  the  Miiller  weights,  or  rather  additions  to 
Muller's  weights,  which  Professor  Urban  made,  arise  from  the  notion 
of  the  probability  of  a  certain  judgment,  with  which  we  are  already 
familiar.  The  analogy  is,  as  it  were,  between  extracting  the  answers 
heavier  or  lighter  from  a  subject,  and  extracting  black  or  white  balls 
from  a  bag  containing  a  mixture  of  these  colours.  Compare,  for  example, 
the  two  statements: 

(1)  From  a  bag  containing  black  balls  and  white  balls,  450  drawings 
are  made,  one  at  a  time,  the  ball  being  returned  each  time  before  the 
next  drawing  is  made.  403  black  balls  are  observed  out  of  the  450. 

(2)  A  subject  on  performing  a  certain  experiment  with  lifted  weights 
sometimes  gives  the  answer  heavier,  sometimes  some  other  answer.    On 
one  occasion,  when  the  weights  were  100  grams  standard  and  104  grams 
unknown,  this  experiment  was  repeated  450  times,  and  the  answer 
heavier  was  obtained  403  times  out  of  the  450. 

Now  if  p  is  the  observed  proportion  (here  403/450)  of  black  balls  in 
a  bag,  then  the  probable  error  of  p  is  known  to  vary  with  Vp  (1  —  p), 
or  Vpq"f.  With  the  same  sized  sample,  a  result  p  =  -5  has  a  larger 
probable  error  than  a  result  p  =  -8  say.  If  anything  similar  holds,  as  the 
analogy  suggests,  for  the  psychometric  experiment,  then  the  seven,  or  w, 

*  "Die  psychophysischen  Massmethoden,"  Archiv  f.  d.  ges.  PsychoL  1909,  xv.  and  xvi. 
p.  357  et  seq. 

t  Cf.  p.  32,  Standard  Deviation  of  the  Binomial  Expansion.  Really  the  true  values 
of  p  and  q  should  be  used,  but  this  is  the  best  we  can  do.  And  further,  the  expression 
probable  error  ceases  to  have  an  accurate  meaning  when  p  is  too  close  to  zero  or  unity 
and  the  distribution  is  in  consequence  very  skew.  But  these  refinements  do  not  affect 
the  argument  except  in  detail. 
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equations  (5)  are  not  equally  reliable,  even  though  based  on  the  same 
number,  450,  of  experiments  each.  In  addition  to  the  Miiller  weights  M 
they  need  other  weights  l/ipq  to  allow  for  this  new  variation  in  reliability. 

These  weights,  it  will  be  observed,  arise  from  the  fact  that  drawing 
balls  singly  from  a  bag  in  this  way  gives  rise  to  a  binomial  distribution, 
and  the  standard  deviation  of  such  a  distribution  is,  as  is  shown  on 
p.  32  in  the  previous  chapter,  equal  to  Vpq.  The  combined  weights 
Mfipq  are  known  as  Urban's  weights,  and  are  also  given  in  a  table  in 
Appendix  I.  Professor  Urban  discusses  the  matter  at  some  length  in 
his  already  cited  article,  and  a  discussion  will  also  be  found  in  Wirth's 
Psychophysik  (Leipzig,  1912),  where  on  p.  151  the  actual  scatter  of 
various  p's  is  given  in  a  diagram. 

The  fact  that  in  the  above  we  have  taken  the  weights  as  proportional 
inversely  to  the  square  of  the  probable  error,  pq,  need  cause  the  reader 
no  trouble,  for  it  is  the  same  phenomenon  as  the  fact  that  the  accuracy 
of  a  set  of  readings  increases  with  the  square  root  of  the  number  of 
readings  made,  as  follows  from  p.  24.  The  weight  of  an  observation 
equation  in  an  ordinary  sense  is  simply  the  number  of  times  the  observa- 
tion has  been  repeated,  that  is  it  is  proportional  to  n  the  number  of 
observations,  which  as  we  have  just  said  is  proportional  inversely  to 
the  square  of  the  probable  error. 

Using  the  symbol  W  for  the  combined  Urban  and  Miiller  weights 
which  are  given  in  tabular  form  in  Appendix  I,  we  have  to  replace  M 
by  W  in  the  equations  (6).  These  equations  we  shall  now  illustrate 
by  giving  at  some  length  the  calculations  indicated  by  them  in  the  case 
of  Urban's  Subject  I,  answers  heavier. 

We  first  look  up  in  Fechner's  Fundamental  Table*,  i.e.  a  table  of  the 
probability  integral,  those  values  of  y  which  correspond  to  the  observed 
values  of  p.  These  are  given  in  the  third  column  of  the  adjoining  table. 
The  fourth  column  of  that  table  gives  the  values  of  the  Urban-Muller 
weights  W,  found  from  the  table  in  the  Appendix. 

Urban's  Subject  I,  Heavier  answers 

Values  of  7  and  W  for  substitution  in  the  normal  equations 

Stimulus  a  grams        p                         y                      W  W» 

84                -0022                 -2-0150               0-025  2-10 

88                -0200                 -1-4520               0-187  16-46 

92                -0889                 -0-9528               0-502  46-18 

96                -2222                 -0-5408               0-806  77-38 

100                -4133                 -0-1549               0-982  98-20 

104                -8956                    0-8888               0-551  57-30 

108                -9400                    1-0993               0-396  42-77 

3-449  340-39 
*  See  Appendix  I. 
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From  this  table  S  ( W)  is  at  once  obtained,  and  the  other  quantities 
appearing  in  equations  (6)  are  easily  formed,  though  the  arithmetic 
is  laborious.  The  work  for  S  (Ws)  is  given  in  the  last  column  of  the  above 
table,  and  the  other  quantities  are  obtained  similarly.  They  prove  to  be 

S  (W)  =         3449, 
S(Ws)  =      340-39, 
S(Wsy)=   -  31-223, 
S  (Wy)  =     -  0-463, 
S(Ws2)  =  33700-1. 

The  equations  (7),  reading  W  instead  of  M,  then  give 
Threshold  T  =  99-68  grams, 
Precision    h  =    0-136113. 
The  standard  deviation  corresponding  to  this  precision  is 

«  =  yO  =  6-2  grams, 

and  -6745(7  =  3-5  grams. 

The  fact  that  99-68,  which  is  thus  found  for  the  threshold  of  answers 
heavier,  is  actually  smaller  than  the  standard  100  grams  than  which  it 
is  judged  heavier,  may  cause  confusion  if  it  is  not  at  once  explained 
that  this  value  involves  a  time  error. 

It  will  be  seen  that  the  Constant  Process  as  described  above  and  as 
illustrated  by  this  example,  involves  a  great  deal  of  arithmetical  work, 
so  much  so  indeed  that  it  is  certain  never  to  be  used  except  in  some 
special  cases,  unless  plans  for  easing  this  labour  be  adopted.  Something 
can  be  done  by  using  the  arithmetical  short-cuts  explained  in  the  pre- 
ceding chapter,  and  Crelle's  Calculating  Tables,  or  better  still  a  calcu- 
lating machine,  makes  the  work  practicable.  But  the  best  device  for 
reducing  the  arithmetical  work  involved  in  the  Constant  Process  is  that 
adopted  by  Professor  Urban  in  publishing  his  tables  for  this  method. 
These  tables  are  given  in  Appendix  I,  and  do  away  with  the  necessity 
for  Fechner's  Fundamental  Table  and  the  Table  of  the  Miiller-Urban 
weights.  They  assume  that  exactly  100  experiments  have  been  made 
at  each  stimulus-value,  but  of  course  if  other  numbers  of  experiments 
have  been  made,  the  p's  can  be  approximated  to  by  two  significant 
figures.  Their  use  will  be  readily  grasped  from  the  worked  example  on 
p.  73  below:  and  that  example  is  also  employed  in  Appendix  I  to 
explain  Eich's  useful  Checking  Table. 
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We  give  next,  as  models,  the  calculations  by  all  the  above  methods 
for  Eiecker's  data,  assuming  however  that  no  two-point  judgments  were 
given  at  zero  stimulus-distance.  The  data  are  given  on  p.  58  and  it 
should  be  noted  that  the  stimuli  are  not  equidistant,  and  that  therefore 
summation  methods  cannot  be  used  to  lessen  the  arithmetical  work. 

Riecker's  Data 

(1)    LIMITING  PROCESS.    By  Urban' s  Formula 
(a)    Just  Perceptibly-two  Points 


8 

P 

9 

q  products 

P 

Ps 

P*2 

0 

•00 

1-00 

1-0000 

•0000 

•0000 

•0000 

0-5 

•10 

•90 

•9000 

•1000 

•0500 

•0250 

1 

•14 

•86 

•7740 

•1260 

•1260 

•1260 

1-5 

•40 

•60 

•4644              -3096 

•4644 

•6966 

2 

•65 

•35 

•1625 

•3019 

•6038 

1-2076 

3 

•80 

•20 

•0325 

•1300 

•3900 

1-1700 

4 

•87 

•13 

•0042 

•0283 

•1132 

•4528 

5 

•96 

•04 

•0002 

•0040 

•0200 

•1000 

6 

1-00 

•00 

•0000 

•0002 

•0012 

•0072 

Sums 

1-0000 

1-7686 

3-7852 

i 

T  =  sum  of  Ps  =  1-7686  Paris  Lines. 

Standard  deviation  of  the  just  perceptibly-two  points,  squared, 
=  sum  of  Ps2,  less  T2 
=  3-7852  -  3-1279  =  -6573, 
whence  standard  deviation  =  -81  Paris  Lines. 

(b)   Just  Imperceptibly-two  Points 


s 

P 

2?  products 

P' 

P'a 

P's* 

0 

•00 

•0000 

•0024 

•0000 

•0000 

0-5 

•10 

•0024 

•0219 

•0109 

•0055 

1 

•14 

•0243 

•1494 

•1494 

•1494 

1-5 

•40 

•1737 

•2606 

•3909 

•5863 

2 

•65 

•4343 

•2339 

•4678 

•9356 

3 

•80 

•6682 

•1670 

•5010 

1-5030 

4               -87 

•8352 

•1248 

•4992 

1-9968 

5 

•96 

•9600 

•0400 

•2000 

1-0000 

1-00 

1-0000 

•0000 

•0000 

•0000 

Sums 

1-0000 

2-2192 

6-1766 

whence 


T  =  2-2192  Paris  Lines, 
a'  =  1-12  Paris  Lines, 
(T  +  T')/2  =  1-99  Paris  Lines, 
Mean  of  the  two  cr's,  0-97  Paris  Lines. 
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(2)    LINEAR  INTERPOLATION 
for  75  %,  50  %  and  25  %  points. 
80-75      3  -  Q2 


75-65 
65-50 


62 -2' 
2-T 


50-40  T-  1-5 ' 
40-25  1-5  -  Ql 
25-14"  Q1-  I  ' 

g2-r  =0-97 

T  -d=--  049 


Q2  =  2-67, 
T  =  1-70, 
Ql  -  1-21 

(skew). 


Interquartile  Range  =  1-46. 
Semi-interquartile  range  =  0-73. 
(Qi  +  T  +  ()2)/3  =  1-86  Paris  Lines. 

Rough  value  for  a,  0-73/0-6745  =  1-08  Paris  Lines. 


(3)    ARITHMETICAL  MEAN  (Spearman's  Formula) 


8 

P 

dp 

centre  8f 

dpy.s' 

dp  x  *'2 

0 

•00 

•10 

•25 

•0250 

•0062 

0-5 

•10 

•04 

•75 

•0300 

•0225  f 

1 

•14 

•26 

1-25 

•3250 

•4063 

1-5 

•40 

•25 

1-75 

•4375 

•7656 

2 

•65 

•15 

2-5 

•3750 

•9375 

3 

•80 

•07 

3-5 

•2450 

•8575 

4 

•87 

•09 

4-5 

•4050 

1-8225 

5 

•96 

•04 

5-5 

•2200 

1-2100 

6 

1-00 

•00 

I 

1-00             Sums 

2-0625 

6-0281 

Threshold  T  =  2-0625  Paris  Lines. 

Square  of  standard  deviation  =  6-0281  -  T2  =  1-7742, 

Standard  deviation  =  1-33  Paris  Lines*. 


*  Without  Sheppard's  correction,  for  which  see  p.  84. 
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(4)   CONSTANT  PROCESS 
using  Urban's  Tables.    (Appendix  I,  p.  194) 


s 

working  s 

P 

W 

yW 

sW 

8*W 

syW 

0 
0-5 
1 
1-5 
2 
3 
4 
5 
6 

-6 
-5 
-4 
-3 

-2 
0 
2 
4 
6 

•00 
•10 
•14 

•40 
•65 
•80 
•87 
•96 
1-00 

•0000 
•5376 
•6463 
•9768 
•9473 
•7695 
•6215 
•3036 
•0000 

•0000 
-  -4871 
-•4937 
-•1750 
•2581 
•4579 
•4960 
•3759 
•0000 

•0000 
-2-6878 
-2-5853 
-2-9306 
-1-8945 
•0000 
1-2430 
1-2146 
•0000 

•0000 
13-4388 
10-3413 
8-7916 
3-7890 
•0000 
2-4860 
4-8582 
•0000 

•0000 
2-4356 
1-9749 
•5252 
-  -5163 
•0000 
•9900 
1-5036 
•0000 

Sums 

4-8026 

1-5869 
-1-1558 

•4311 

2-4576 
-  10-0982 

-7-6406 

43-7049 

7-4293 
-•6163 
6-9130 

.     -  7-64  x  6-91  -  -431  x  43-7       _ 

4-80  x  6-91  +  -431  x  7-64 
in  working  units  from  the  working  origin 

=  3 ^-  =  2'02  Paris  Lines, 

4-80  x  6-91  +  -431  x  7-64 

4-80  x  43.7  -  7-642         -241  in  working  units, 

whence  a  =  l/(\/2h)  =  2-93  working  units 

=  1-46  Paris  Lines. 

Titchener,  using  the  Constant  Process  with  Miiller  weights  alone 
(the  above  is  with  the  Miiller-Urban  weights),  obtained 

*  T  =  1-88  Paris  Lines, 

h  =  0-49, 

whence  a  =  1-44  Paris  Lines. 

Summarising  in  one  table  we  have : 

Riecker's  Data  calculated  by  different  Processes 

Process  Threshold  T  Scatter  <r 

Limiting  Process  1-99  0-97 


Linear  Interpolation 
Arithmetical  Mean 
Constant  Process 


1-86 
2-06 
2-02 


1-08 
1-33 
1-46 


The  Probable  Error  of  the  Thresholds  calculated  in  these  different  ways. 
The  values  of  the  standard  deviation  given  in  the  table  immediately 
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above  refer  of  course  to  the  variation  of  the  individual  limina  of  which 
the  threshold  T  is  a  central  measure.  If  there  are  n  of  these  individual 
limina,  then  in  an  ordinary  way  the  arithmetical  mean  of  these  would 
have  a  standard  deviation  of  aj^n.  The  standard  deviations  of  the 
above  values  of  T  are  indeed  of  this  order  of  magnitude,  say  in  the 

fir8tcase  0-97M100)  =  -097, 

but  to  avoid  misconception  ought  to  be  regarded  as  distinctly  larger 
than  this.  This  arises  from  various  causes  which  cannot  here  be  gone 
into.  In  the  case  of  the  Limiting  Process  there  is  the  dependence  upon 
the  particular  choice  of  stimuli,  in  the  case  of  Spearman's  Arithmetical 
Mean  formula  there  is  the  uncertainty  about  the  centring  of  the 
"tails"  of  the  distribution,  etc. 

Professor  F.  M.  Urban,  some  twelve  years  ago*,  brought  forward 
reasons  which  in  his  opinion  showed  that  the  Method  of  Limits  was 
much  more  exact  than  the  Constant  Method:  but  his  mathematics 
contained  certain  errors^.  The  various  processes  do  not  differ  very 
widely  in  this  respect  if  each  is  used  in  circumstances  favourable  to 
itself,  but  on  the  whole  the  Constant  Process  is  most  reliable,  and 
Linear  Interpolation  least. 

In  conclusion,  we  may  venture  to  express  a  cautious  opinion  on  the 
choice  of  a  process  of  calculation  from  among  those  given.  Frequently 
of  course  there  is  no  choice,  for  the  conditions  of  experiment  fix  the 
matter  for  us.  For  example,  if  the  just  perceptible  points  have  been 
recorded  but  not  the  frequencies  p  of  answers  of  a  certain  kind  at  each 
stimulus- value,  then  the  direct  Limiting  Process  must  be  employed. 
We  will  suppose  however  that  the  fullest  records  have  been  taken. 

If  the  points  at  which  p  =  0  and  p  =  1,  that  is  the  points  where  only 
one  kind  of  answer  is  given,  are  known  or  are  very  nearly  approached, 
the  Arithmetical  Mean  of  Spearman  is  in  our  opinion  best. 

If  however,  as  is  frequently  the  case,  these  points  are  unknown,  then 
the  simple  linear  interpolation  for  the  25  %,  50  %,  and  75  %  points  is 
a  good  plan  from  the  point  of  view  of  simplicity  and  is  often  of  sufficient 
accuracy. 

If  the  accuracy  of  the  data  justifies  the  use  of  the  full  Constant 
Process,  then  Urban's  Tables  lighten  the  work  enormously,  and  give 

*  "Die  psychophysischen  Massmethoden  als  Grundlagen  empirischer  Messungen," 
Archivf.  d.  get.  Psychol.  1909,  xv.  and  xvi.  pp.  261 — 415. 

t  G.  H.  Thomson,  "Note  on  the  Probable  Error  of  Urban's  Formula  for  the  Method 
of  Just  Perceptible  Differences,"  Brit.  Journ.  Psychol  1913,  vi.  p.  217  and  "The  Accuracy 
of  the  Phi-gamma  Process,"  ibid.  1914,  vn.  p.  44. 
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perfect  accuracy  if  exactly  100  experiments  have  been  made  at  each 
stimulus.  The  great  advantage  of  the  Constant  Process  lies  in  the  fact 
that  the  "tail"  difficulty  does  not  arise.  Experiments  in  which  it  is  on 
psychological  grounds  inadvisable  to  employ  extreme  stimuli  can  only 
be  handled  by  this  process. 

But  it  is  not  worth  while  applying  it  unless  the  calculator  is  assured 
that  the  experimental  accuracy  justifies  it,  and  unless  he  has  convinced 
himself,  by  methods  to  be  described  in  the  next  chapter,  that  the 
distribution  is  not  significantly  skew,  and  the  data  not  heterogeneous. 
Very  few  collections  of  psychophysical  data  are  worth  the  accuracy  of 
the  Constant  Process,  which  is  however  undoubtedly  the  best  theoreti- 
cally for  symmetrical  distributions. 

(5)  DIFFERENCE  THRESHOLDS  AND  THE  PROBABILITY  OF  A 
JUDGMENT  OF  A  CERTAIN  CATEGORY 

A  question  closely  bound  up  with  the  mathematics  of  the  psycho- 
physical  methods  is  that  of  the  best  measure  of  a  subject's  sensitivity 
to  differences  of  stimulus-value. 

To  fix  ideas,  we  shall  use  the  case  already  discussed  of  the  difference 
threshold  for  lifted  weights.  When  a  sufficient  number  of  judgments 
has  been  collected,  the  three  categories  lighter,  equal  or  undecided,  and 
heavier,  are  found  to  occur  with  varying  frequency  with  the  different 
comparison  weights.  The  difference  threshold  is  then  decided  by  the 
positions  of  the  points  T  and  T'  where  the  descending  lighter  and 
ascending  heavier  curves  cross  the  halfway  line  (see  Fig.  7,  p.  59).  The 
distance  (T  —  T')/2  or  some  closely  similar  quantity  is  what  is  called  the 
difference  threshold,  and  is  commonly  used  in  comparing  the  sensitivity 
of  different  subjects.  The  smaller  T  —  T',  the  more  sensitive  the  subject 
is  said  to  be. 

This  distance  however  depends  entirely  on  the  subject's  readiness 
to  give  the  answer  undecided.  It  measures  therefore  rather  a  moral 
characteristic  than  a  physical  sensitivity,  and  varies  very  much  with 
the  instructions  given  to  the  subject.  The  moral  character  of  the 
measure  T  —  T'  is  above  all  seen  from  the  fact  that  any  subject  who 
wishes  may  reduce  it  to  zero,  whatever  may  be  his  actual  sensitivity  to 
differences  of  weight,  simply  by  determining  that  he  will  never  give  the 
answer  undecided. 

There  is  however  another  measure  which  has  been  used.  This  can 
be  most  conveniently  described  by  considering  first  a  case  in  which  a 
subject  gives  no  undecided  answers.  In  such  a  case,  the  thresholds TandT' 
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have  come  together  and  on  the  previous  plan  the  subject's  sensitivity 
would  be  considered  as  infinite,  and  all  subjects  giving  no  undecided 
answers  would  have  the  same  infinite  sensitivity:  whereas  clearly  the 
subject's  sensitivity  is  connected  with  the  rapidity  with  which  the 
curves  pass  from  0  to  1  or  vice  versa,  and  two  subjects  may  differ  very 
much  in  this  respect  even  although  they  both  give  no  undecided  answers. 
Under  these  circumstances  a  measure  which  has  been  used  is  the  distance 
Q  —  Q'>  the  horizontal  distance  between  the  crossing  of  the  -25  and  -75 
lines  (see  Fig.  7,  p.  59).  Under  another  guise  it  was  used  by  Fechner  also 
for  the  cases  where  undecided  answers  were  given.  In  such  cases  he 
reduced  the  three  curves  to  two  by  sharing  the  undecided  answers 
between  heavier  and  lighter. 

This  measure  has  the  advantage  that  the  subject  cannot  increase 
his  apparent  sensitivity  at  will,  as  was  the  case  with  the  "threshold" 
measure.  Q  —  Qr  is  the  interquartile  range  of  the  point  of  subjective 
equality,  represented  by  the  crossing  of  the  heavier  and  lighter  curves. 
It  and  the  difference  threshold  measure  distinctly  different  things, 
and  subjects  placed  in  order  of  merit  by  the  one  will  be  found  in  a 
different  order  by  the  other. 

The  points  here  raised  seem  to  suggest  an  extension  of  Urban's  idea 
of  the  probability  of  a  judgment,  which  compares  the  giving  of  the 
judgments,  heavier,  undecided  or  lighter,  with  drawing  a  ball  from  an 
urn  containing  say  red,  white,  and  blue  balls,  and  ascertaining  its  colour. 
For  each  stimulus  the  urn  is  supposed  to  contain  different  porportions 
of  the  coloured  balls. 

In  place  of  this  is  suggested  the  following.  For  each  stimulus 
imagine  an  urn  containing  an  infinite  number  of  balls  some  black  and 
some  white,  in  a  proportion  varying  in  some  way  with  the  stimulus. 
A  judgment  may  then  be  compared  with  taking  not  one  but  a  handful 
of  balls  from  the  urn,  the  kind  of  judgment  depending  upon  the  pro- 
portion of  black  balls  in  the  handful. 

From  this  point  of  view,  the  standard  weight,  the  variable  weight, 
and  the  physiological  make  up  of  the  subject  decide  the  proportion  of 
black  balls  in  the  urn:  but  the  decision  as  to  what  proportion  is  to  be 
called  heavier,  what  undecided,  and  what  lighter,  depends  upon  a  con- 
scious act  of  the  subject. 


CHAPTER  IV 

SKEWNESS  AND  HETEROGENEITY  IN  PSYCHOPHYSICAL  DATA 

Obvious  skewness  of  many  psychophysical  curves — Pearson's  test  for  goodness  of 
fit  applied  to  the  method  of  average  error — Applied  to  the  method  of  right  and 
wrong  cases — Skew  curves  in  homogeneous  material — The  summation  method  of 
finding  moments — Calculation  of  a  skew  curve — Analysis  into  two  normal  curves — 
Conclusions. 

(1)  OBVIOUS  SKEWNESS  OF  MANY  PSYCHOPHYSICAL  CURVES 
To  anyone  accustomed  to  handling  distribution  data,  a  most  striking 
point  about  the  results  of  many  psychophysical  experiments  is  the 
obvious  skewness  of  much  of  the  data.    Both  the  examples  used  ex- 
tensively in  the  previous  chapter  show  this  very  strongly  as  can  be  seen 
from  an  inspection  of  the  data  either  in  numerical  or  diagrammatic 
form,  and  also  from  the  various  values  of  the  threshold  T  found  by  the 
different  processes  of  calculation :  for  these  differences  arise  largely  from 
the  fact  that  the  distribution  is  not  normal. 

Taking  the  best  of  the  processes,  the  Constant  Process,  it  is  of 
interest  to  see  how  closely  the  curve  which  it  gives  fits  the  original  data. 
A  method  of  thus  estimating  the  goodness  of  fit  of  curves  has  been  given 
by  Professor  Karl  Pearson.  His  method  is  perfectly  general,  and 
applicable  to  all  classes  of  curves*,  but  it  has  been  most  fully  worked 
out  for  the  fitting  of  bell-curves  to  histograms.  Our  problem  is  not  of 
this  nature,  though  it  might  appear  to  be  so,  for  the  pseudo-histogram 
(Fig.  8)  which  can  be  formed  from  the  frequencies  p  differs  essentially 
from  a  real  histogram.  Since  in  psychophysics  it  may  often  be  necessary 
to  fit  curves  to  real  histograms,  for  example  those  obtained  in  the 
Method  of  Average  Error,  we  shall  first  explain  Pearson's  Goodness  of 
Fit  Test  for  this  case,  using  the  bisection  data  of  Chapter  II  for  the 
purpose  (see  pp.  15  and  42  and  Fig.  6). 

(2)  PEARSON'S  TEST  FOR  GOODNESS  OF  FIT  APPLIED  TO  THE 

METHOD  OF  AVERAGE  ERROR 

The  bisection  data  had  a  mean  of  60-13  mms.  and  a  standard  deviation 

of  1-38  mms.    With  these  values,  using  Sheppard's  Tables,  we  draw  the 

smooth  curve  shown  in  Fig.  6.   Now  it  is  important  at  the  outset  to 

*  Phil  Mag.  July  1900,  Fifth  series,  L.  pp.  157—175.    Phil  Mag.  April  1916,  Sixth 

series,  xxxi.  pp.  369—378. 
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realise  that  whether  that  curve  is  a  good  or  bad  fit  to  the  data  depends 
on  the  number  of  observations  made.  The  number  in  this  case  was 
only  29,  and  it  will  presently  be  shown  that  the  curve  is  a  very  good  fit. 
But  had  the  number  of  observations  been  2900  it  would  have  been  a 
bad  fit,  for  with  such  a  number  of  observations  the  histogram  ought  to 
have  modelled  itself  more  closely  to  the  curve. 

In  order  to  apply  Pearson's  test,  we  must  find  the  theoretical  histo- 
gram for  comparison  with  the  observed  histogram.  That  is,  we  must 
find  the  areas  of  the  slabs  of  the  curve  in  Fig.  6  which  replace  the 
rectangles.  (One  of  these  slabs  is  cross-hatched  in  that  figure  to  explain 
more  clearly  what  is  here  meant.)  This  is  most  easily  done  from 
Sheppard's  Tables  by  calculating  the  areas  of  the  smooth  curve  from 
—  oo  up  to  each  dividing  ordinate  in  turn,  and  taking  the  differences 
of  these  numbers,  as  is  done  in  the  following  table.  The  quantity 
|  (1  +  a)  in  Sheppard's  Tables  is  the  area  of  a  Normal  Curve,  of  unit 
total  area,  up  to  the  ordinate  xja.  For  negative  re's  it  has  to  be  subtracted 
from  unity. 

Calculation  of  the  Theoretical  for  comparison  with  the  Observed 
Histogram  of  the  Bisection  Data 


x  mms. 

x'=x-  60-13 

Sheppard's 

*=*'/!  -38* 

Sheppard's 
*(!+«) 

Multiplied 
by  29* 

Differences 

0-09 

63-95 

3-82 

2-77 

•997 

28-91 

0-52 

62-95 

2-82 

2-04 

•979 

28-39 

2-09 

61-95 

1-82 

1-32 

•907 

26-30 

5-36 

60-95 

0-82 

0-59 

•722 

20-94 

7-95 

59-95 

-0-18 

-0-13 

•448 

12-99 

7-33 

58-95 

-1-18 

-0-86 

•195 

5-66 

4-01 

57-95 

-2-18 

-1-58 

•057 

1-65 

1-36 

56-95 

-3-18 

-2-31 

•010 

0-29 

0-29 

29-00 

The  actual  and  theoretical  histograms  are  then  compared  in  the  next 
table.  Pearson's  method,  the  theory  of  which  cannot  be  given  here, 
then  forms  the  quantity 

2  _  o       /square  of  differences  of  theoretical  and  observed  frequencies'^ 
\  theoretical  frequency  / ' 

With  this  value  x2,  and  n'  the  number  of  cells  in  the  histogram, 
Table  XII  in  Pearson's  Tables  is  then  entered  and  a  value  of  P  found. 

*  These  can  be  calculated  at  one  opening  of  CrelJe's  Tables,  or  by  one  setting  of  a 
slide  rule. 
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ri  is  here  9  counting  in  the  two  tail  cells.  P  is  then  the  probability  that 
the  observed  or  a  worse  distribution  will  be  obtained  (assuming  the 
theoretical  distribution)  in  a  sample  of  the  size  taken,  here  29. 

Calculation  of  Goodness  of  Fit  of  Normal  Curve  to  the  Bisection  Data 


mms. 

Actual  observa- 
tions in 

Theoretical  m' 

(m-m')*/mf 

Above         63-9 

0 

0-09 

•090 

83—63-9 

1 

0-52 

•443 

62—62-9 

2 

2-09 

•004 

61—61-9 

6 

5-36 

•076 

60—60-9 

6 

7-95 

•478 

59—59-9 

8 

7-33 

•061 

58—58-9 

5 

4-01 

•244 

57—57-9 

1 

1-36 

•095 

Below  57 

0 

0-29 

•290 

29 

29-00 

1-781  =X* 

No.  of  cells  n'  =  9. 

From  Elderton's  Tables,  No.  XII  in  Pearson's  Tables, 

P  =  0-998  for  x2=l, 
0-981  for  x2  =  2, 
therefore  P  =  0-99  approximately. 

In  our  case  we  find  P  =  0-99,  i.e.  99  samples  of  this  normal  distri- 
bution out  of  100  would  give  no  better  a  fit  than  the  present.  The 
Normal  Curve  is  therefore  a  perfectly  satisfactory  theory  for  these 
29  observations. 

(3)  PEARSON'S  TEST  OF  GOODNESS  OF  FIT  APPLIED  TO  THE 
METHOD  OF  RIGHT  AND  WRONG  CASES 

The  above  plan  of  testing  goodness  of  fit  cannot  however  be  applied 
to  the  pseudo-histogram  of  Fig.  8*.  The  reasons  why  this  is  so  cannot 
be  here  gone  into  in  detail,  but  they  are  based  upon  the  following 
differences  between  the  two  cases.  In  a  real  histogram,  if  any  one  of 
the  cells  is  larger  than  it  ought  to  be,  then  any  other  must  have  a 
tendency  to  be  smaller  than  it  ought  to  be.  There  is  a  strong  negative 
correlation  between  the  numbers  in  the  cells,  a  correlation,  that  is,  from 
trial  to  trial.  In  the  psychometric  pseudo-histogram,  however,  formed 
from  the  proportions  p,  this  is  otherwise,  because  the  p's  are  measured 
quite  separately  from  one  another.  In  a  real  histogram  the  numbers  in 

*  G.  H.  Thomson,  "The  Criterion  of  Goodness  of  Fit  of  Psychophysical  Curves," 
Biometrika,  1919,  xn.  pp.  216—230. 
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each  cell  are  necessarily  positive  quantities.  In  the  psychometric 
pseudo-histogram  they  may  be  negative,  if  the  p's  do  not  rise  steadily. 
Psychometrical  data  of  the  kind  here  considered,  in  fact,  as  has 
already  been  pointed  out,  are  not  really  in  histogram  form.  Although 
a  kind  of  histogram  can  be  deduced  from  them,  it  is  only  by  making 
certain  assumptions,  and  the  intercorrelations  of  the  cells  of  this  arti- 
ficial histogram  are  different  from  the  intercorrelations  of  a  naturally 
observed  histogram.  Under  these  circumstances  we  must  in  applying 
the  goodness  of  fit  test  turn  to  the  directly  observed  quantities  p.  These 
are  compared  with  the  values  calculated  from  the  Constant  Process  in 
the  following  table,  the  remaining  columns  of  which  are  explained  below: 


Calculation  of  Goodness  of  Fit  of  a  Constant  Process  Ogive 

Urban's  Subject  I,  answers  heavier 


Grams  * 

Observed  p 

Calculated  p' 

P-P' 

(P-P'W* 

84 
88 
92 
96 
100 
104 
108 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•0013 
•0122 
•0697 
•2394 
•5246 
•7972 
•9454 

•0009 
•0078 
•0192 
-•0172 
-•1113 
•0984 
-•0054 

•001 
•005 
•006 
•002 
•050 
•060 
•001 

•125 

=S{(p-p'Wp'q'} 

X2  =  450  x  -125  =  56-25, 

n'  =  8  (one  more  than  the  number  of  stimuli), 

P  =  -0000005  from  Pearson's  Tables. 

To  these  quantities  p  the  principles  underlying  Pearson's  test  can 
be  applied  direct.  They  are  indeed  the  same  principles  already  used  in 
Chapter  II  when  we  were  discussing  curve  fitting.  We  have  -n  quantities 
p  which  are  independently  measured,  and  n  quantities  p'  which  are 
theoretically  given.  The  variations  of  p  from  p'  are  binomial  in  form, 
that  is  approximately  normal.  If  we  look  upon  the  judgment  heavier, 
as  suggested  in  an  earlier  paragraph,  as  being  comparable  with  drawing 
black  balls  out  of  a  bag  containing  black  balls  and  white  balls  in  the 
proportions  p'  and  1  —  p',  then  the  probable  error  of  p  is 


k  being  the  number  of  judgments  of  which  pk  are  of  the  category  heavier. 
For  the  chances  of  obtaining  0,  1,  2,  ...  ,  k  —  1,  or  k  black  balls  in 
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a  drawing  of  k  are  given  by  the  terms  of  the  binomial  (pf  -f  q')k,  q'  being 
1  —  pf  ;  that  is,  the  chances  of  obtaining 

p  =  0/&,  1/k,  ...(k-  l)/k,  k/k  or  unity. 
The  standard  deviation  of  the  above  binomial  is  V(&PY)>  an(*  tne 


standard  deviation  of  p  therefore  ^  -\/(k>p'q'),  or  ^/(p'q'lk).   The  proba- 
bility of  an  error  p  —  p'  is  therefore 


The  probability  of  the  whole  set  of  observed  values  pl9  p2,  ...  p 
occurring  is  the  product  of  n  such  factors,  and  is  of  the  form 


or  if  k  is  the  same  at  each  stimulus, 


The  remaining  columns  in  the  above  table  calculate  this  quantity*, 
which  is  the  same  as  Pearson's  x2>  under  our  special  circumstances. 
For  n'  we  have  to  use  n  -f-  1,  because  all  the  n  values  of  p  can  vary 
separately,  whereas  in  a  real  histogram  the  number  of  variables  is  one 
less  than  the  number  of  cells.  We  thus  reach  the  value  P  =  -0000005, 
that  is,  the  curve  is  an  incredibly  bad  fit  to  the  data,  which  cannot 
possibly  be  regarded  as  differing  from  a  normal  distribution  by  sampling 
errors  alone. 

(4)  SKEW  CURVES  IN  HOMOGENEOUS  MATERIAL 

There  are  two  immediate  hypotheses  which  present  themselves  to 
explain  this  bad  fit  of  the  normal  curve,  (a)  that  the  material  is  not 
homogeneous,  the  conditions  of  experiment  not  having  remained  the 
same  throughout,  (6)  that  the  material  is  homogeneous,  but  the  under- 
lying factors  which  cause  the  distribution  of  errors  or  deviations  are 
not  independent,  but  correlated,  as  in  the  system  of  generalised  proba- 
bility curves  next  to  be  described.  A  word  of  caution  may  first  be  given, 
for,  as  we  have  already  suggested,  the  identification,  or  even  the 
approximation,  of  graphical  representations  of  "percentage"  judgments 
to  true  frequency-distributions  is  of  extremely  doubtful  validity. 

*  A  table  in  Appendix  I,  giving  values  of  l/(pq),  lightens  the  work  considerably. 

B.  &  T.  6 
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The  general  theory  of  curve  fitting  has  been  worked  out  in  great 
detail  by  Professor  Karl  Pearson*.  A  good  account  of  his  method  is 
given  in  a  book  by  W.  Palin  Elderton,  Frequency  Curves  and  Correlation, 
C.  and  E.  Layton,  London,  1906,  to  which  the  mathematical  reader  is 
referred  for  fairly  complete  theoretical  and  practical  information. 

Frequency  curves  of  data  not  involving  a  mixture  of  species  tend 
to  commence  at  zero,  rise  to  a  maximum,  and  then  fall  either  at  the 
same  or  at  a  different  rate.  There  is  often  high  contact  at  one  or  both 
ends  of  the  distribution.  An  equation  of  the  general  form 

dy  =  (x  +  a)  y 

dx         f(x) 

satisfies  both  these  conditions,  since  if  y  =  0,  dy/dx  =  0  (high  contact), 
and  if  x  =  —  a,  dy/dx  =  0  (maximum;  for  a  maximum,  again,  the 
second  differential  coefficient  must  be  negative).  Expanding  /  (x)  by 
Maclaurin's  Theorem,  we  have 

<%  = (x  +  a)  y 

dx      c0  +  CjX  +  c2xz  +  Cgic3  +  ... ' 

(1)  Putting  cx  =  c2  =  c3  =  ...  =  0,  we  have 

1  dy  _  x  -f-  a 

ydx         Co~ 

which  is  the  Gaussian  or  normal  curvef.  It  fits  the  symmetrical  binomial 
(J  +  l)n>  e-g-  m  com  tossing,  where  the  chances  for  and  against  are 
equal  (p  =  q),  and  the  contributory  causes  are  independent  of  one 
another. 

(2)  Putting  c2  =  Cg  =  ...  =  0,  we  have 

1.  dy  _    x  +  a 

ydx~  cQ  +  c±x  ' 

which  represents  a  class  of  curves  varying  from  the  Gaussian  curve  to 
the  J-curve.  It  fits  the  asymmetrical  or  "point"  binomial  (p  +  q)n,  e.g. 
in  teetotum  spinning  or  dice  throwing,  where  the  chances  for  and 
against  are  not  equal,  p  ^  q,  but  the  contributory  causes  are  still 
independent  of  one  another. 

*  Karl  Pearson,  "Skew  Variation  in  Homogeneous  Material,"  Phil  Trans.  1895, 
CLXXXVI.  A,  pp.  343  ff. ;  "  On  the  Systematic  Fitting  of  Curves  to  Observations  and  Measure- 
ments," Biometrika,  I.  pp.  265  ff.  and  n.  pp.  1  ff.,  1901 — 3;  "On  the  Curves  which  are 
most  suitable  for  describing  the  frequency  of  Random  Samples  of  a  Population," 
Biometrika,  1906,  v.  pp.  172 — 5  (an  exceedingly  clear  summary  of  the  principles  involved). 
Also  later  papers  in  Phil  Trans.  1901,  cxcvn.  A,  pp.  443 — 459,  and  1916,  being  supplements 
to  the  first  mentioned  memoir.  See  also  pp.  Ix  to  Ixx  in  Pearson's  Tables  for  Statisticians 
and  Biometricians,  Cambridge,  1914. 

f  Compare  equation  (5),  Chap.  II,  p.  34. 
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;4  =  . . .  =  0,  we  have 

1  dy  _         x  +  a 
y  rfx      c0  +  Cj£  - 


(3)    Putting  c3 


which  can  be  made  to  represent  almost  all  the  frequency  distributions 
which  may  arise.  It  fits  the  hypergeometrical  series,  the  successive 
terms  of  which,  e.g.  give  the  chances  of  getting  k,k  —  1,  ...  0  black  balls 
from  a  bag  containing  pn  black  and  qn  white  balls  when  k  balls  are 
drawn*. 

Here  the  contributory  causes  are  not  independent  of  one  another. 

There  is  no  advantage  in  employing  equations  which  involve  Cg  and 
higher  constants,  because  their  use  necessitates  the  calculation  of  the 
6th  and  higher  "moments,"  and  these  have  very  high  probable  errors. 

Definition.  The  nth  moment  coefficient  (//,„')  of  any  distribution  about 
any  ordinate  is  the  sum  of  the  products  of  the  partial  frequencies  and 


Centroid 
vertical 


axis  of  x 


Fig.  10 


the  wth  power  of  the  distances  of  these  frequencies  from  the  ordinate, 
divided  by  the  total  frequency.  In  symbols,  if  N  be  the  total  frequency, 


NI**' ~]  #yte> 

The  moments  are,  in  practice,  first  calculated  about  any  arbitrary 
ordinate  that  is  most  convenient,  and  then  reduced  to  moments  about 
the  centroid  in  the  following  way  (dashed  ju,'s  represent  moments  about 
an  arbitrary  ordinate,  undashed  /A'S  about  the  central  ordinate)  : 


=     (x  -  x)«yox  =  N^n'  - 
This  gives  the  general  reduction  formula 


,  ,  ,          n  (n  —  1)     ,2  , 

=  Mn    ~  ^Ml  /*  n-l  +   —  J^  -  ^  V  «-2  ~  •••• 


*  The  series  is 
pn  (pn-l) ...  (pn  — 

n(n-l)  ...(n-k  +  l) 
Other  series  may  arise. 


k(k-i) 
21 


qn(qn-l) 


-k  +  l)(pn-k+2) 


6—2 
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which  enables  us  to  transfer  any  moment  from  an  arbitrary  ordinate 
to  the  mean.  Thus  we  have: 


The  symbols  p  represent  moments  of  the  curve;  but  we  have  to 
start  with  grouped  frequencies,  where  the  frequencies  are  assumed  to 
be  concentrated  along  the  mid-ordinates  of  the  rectangles  (cf.  Fig.  10  B). 
The  moments  obtained  from  these  grouped  frequencies  are  denoted  by 
v's  (dashed  and  undashed),  and  corrections  are  necessary.  These  have 
been  deduced  by  Sheppard*  and  are  consequently  known  as  Sheppard's 
adjustments.  They  are: 

to  =  "2  ~  T?> 

/*3  ="S» 

/*4  =  "4  ~  i"2  +  *fc- 

It  is  generally  said  that  they  are  only  valid  when  there  is  "high 
contact  "  at  the  ends  of  the  frequencies,  but  the  equations  for  /i2  and  p,3 
are  probably  still  valid  even  without  high  contact,  if  the  terminal 
frequencies  are  zero. 

(N.B.  In  working,  v"s  are  changed  into  i/s  before  applying  Sheppard's 
corrections.) 

It  is  obvious  that  VQ  =  I  and  vl  =  0.  i>/  is  the  distance  of  the  mean 
or  centroid  vertical  from  the  arbitrary  ordinate  about  which  the  moments 
are  first  taken,  and  is  conveniently  known  as  d. 

Two  very  important  constants  in  curve  fitting  are 


The  values  of  these  are  always  to  be  calculated,  and,  within  the  limits 
of  their  probable  errors,  they  fix  the  type  to  which  the  curve  belongs. 
The  general  frequency  curve  equation,  written  in  terms  of  moments,  is 

c  ,  g  Vft     (A  +  3) 

ldy_  __  2    5fe-6fc-9  _ 

ydx"    '      (      4&-3&          Vft     (0,  +  3)      x      2/32  -  3ft  -  6  ,x\  »]  • 
r  110^-12^-18^"    2  5ft-  6ft-  9  "a"^  10ft  -12)8!  -18  W  ) 


*  W.  F.  Sheppard,  "On  the  Calculation  of  the  most  Probable  Values  of  Frequency 
Constants,  for  Data  arranged  according  to  Equidistant  Divisions  of  a  Scale,"  Proc.  Lond. 
Math.  Soc.  xxix.  pp.  363  ff.  Karl  Pearson,  "On  an  Elementary  Proof  of  Sheppard's 
Formulae  for  correcting  Raw  Moments  and  on  other  allied  Points,"  Biometrika,  1904,  m. 
pp.  308  ff  . 
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This  gives  at  once,  for  the  distance  between  the  mean  and  the  mode, 


(origin  is  at  mean). 

Hence  the  curve  is  symmetrical  if  fa  =  0.  If  fa  =  0  and  fa  =  3,  the 
curve  reduces  to  the  Gaussian  or  Normal  Curve,  since  the  terms  in- 
volving x  in  the  denominator  of  the  right-hand  side  of  the  general 
frequency  curve  equation  then  vanish. 

In  using  the  Pearsonian  method,  then,  the  order  of  procedure  to  be 
adopted  is  : 

(1)  Calculate  the  moment  coefficients  v^,  v2',  i/3',  v±   about  a  con- 
venient arbitrary  ordinate, 

(2)  Transfer  to  the  mean  by  the  equations 


v^  =  „/  _  4^  V  +  6iV  V  ~  V4. 

(PI  or  d  is  the  distance  of  the  mean  from  the  arbitrary  ordinate.) 

(3)  Determine  the  corresponding  moments  for  the  curve  by  the 
equations 

f*2  =  "2  ~  T*  ] 

/zj  =  i/3  >  Sheppard's  corrections. 

/*4  =  "4  -  ¥2  +  rfj 

(N.B.    For  these  corrections  to  be  applicable,  two  conditions  must  be 
fulfilled: 

(i)  there  must  be  high  contact, 

(ii)    the  grouping  of  the  frequencies  must  be  equal.) 

(4)  Calculate  fa  and  fa  by  the  equations 

R       ^     R        P* 

&  =  S-3.&  =  -2. 

These  results  give  the  distance  of  the  mean  from  the  arbitrary  ordinate 
(i//  or  d)  ;  the  standard  deviation  (Vfa)  J  an(i  the  mode 

mean       _fV&_(&±3L 
-  2  (5&  -  6&  -  9)  ' 

The  median  is  more  difficult  to  determine  exactly,  but  a  position 
which  is  approximate  and  indeed  very  accurate  in  all  the  curves  we  are 
likely  to  meet  with  in  this  work,  is  between  the  mean  and  the  mode,  but 
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nearer  to  the  mean,  so  that  the  distance  from  the  mode  is  twice  the 
distance  from  the  mean*. 

Mean        Median        Mode 


1/3  2/3 

The  investigator  may  then  proceed  to  determine  to  which  of  the 
Pearsonian  "types"  the  particular  curve  belongs,  to  find  its  equation, 
and  to  plot  it.  The  type  is  decided  by  the  constants  ^  and  /?2  (using 
Diagram  XXXV,  p.  66,  Pearson's  Tables)  or  by  the  criterion  /c2  where 


using  the  following  diagram: 


Typel 


TypeV 


Type  VI 


The  equations  to  the  types  are  given  in  detail  in  Pearson's  Tables, 
p.  Lxiii,  or  in  Elderton  (op.  rit.),  where  valuable  advice  on  the  arrange- 
ment of  the  calculations  will  be  found.  Elderton's  Type  II  includes 

*  Karl  Pearson,  Biometrika,  i.  1902—3,  p.  265;  Phil.  Trans.  CLXXXVI.  A,  p.  375;  Roy. 
Soc.  Proc.  Lxvra.  p.  369 b.  C.  V.  L.  Charlier,  "Researches  into  the  Theory  of  Probability," 
Lund*  Universitets  Arsskrift,  1905  (1),  1,  equation  9.  Arthur  T.  Doodson,  "Relation  of  the 
Mode,  Median,  and  Mean  in  Frequency  Curves,"  Biometrika,  1917,  xi.  p.  425. 
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both  our  Types  II  and  VII,  and  his  Type  VII  is  our  G,  the  Normal 
Curve. 

We  shall  confine  ourselves  to  a  fuller  description  of  Type  IV,  which 
appears  to  be  the  type  most  common  in  psychometric  skewness*. 

It  will  be  clear  from  the  above  account  that  the  whole  of  the  calcula- 
tions are  based  upon  the  first  four  moments  of  the  data,  and  we  proceed 
first  to  describe  the  most  convenient  way  of  finding  these  when  as  here 
the  stimuli  are  equidistant,  viz.  the  summation  method  already  used 
for  finding  the  mean  on  p.  19.  If  the  stimuli  are  not  equidistant  the 
calculations  are  rather  longer. 


(5)  THE  SUMMATION  METHOD  OP  FINDING  MOMENTS 
IN  THE  CASE  OF  DATA  AT  EQUIDISTANT  POINTS 

A  full  description  of  this  device  will  be  found  in  Mr  Palin  Elderton's 
book  already  cited,  where  it  is  attributed  to  Mr  G.  F.  Hardy.  It  has 
however  been  independently  used  by  numerous  writers,  e.g.  Lipps, 
Wirth,  Urban,  etc.f 

The  theory  cannot  be  worked  out  here,  but  the  reader  can  easily  do 
so  for  himself.  It  is  only  a  question  of  simple  algebra ;  or  from  another 
point  of  view  it  is  the  same  thing  as  integration  by  parts.  We  give  only 
a  worked  example,  Urban's  Subject  I,  heavier  answers. 


Example  of  the  Use  of  the  Summation  Method 


Grams 

p                  series  of  successive  sums 

84 
88 
92 
96 
100 
104 
108 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•0022 
•0222 
•1111 
•3333 
•7466 
1-6422 
2-5822 

•0022 
•0244 
•1355 
•4688 
1-2154 
2-8576 
5-4398 

•0022 
•0266 
•1621 
•6309 
1-8463 
4-7039 
10-1437 

2-5822 
£2  ord 

5-4398 

S3 

10-1437 

s* 

17-5157 

£5 

*  Thirteen  out  of  fifteen  psychometric  curves  tried  recently  were  Type  IV.  Of.  Pearson 
on  zoological  and  anthropological  curves,  where  Type  IV  also  prevails,  Phil.  Trans.  1895, 
CLXXXVI.  A,  Part  I,  pp.  388,  403  and  411. 

t  G.  F.  Lipps,  "Die  Theorie  der  Collectivgegenstande,"  Wundt's  Phil.  Stud.  Bd.  xvn. 
Separat-Abdruck,  Leipzig,  1902;  W.  Wirth,  "Die  mathematischen  Grundlagen  der  soge- 
nannten  unmittelbaren  Behandlung  psychophysischer  Resultate,"  Wundt's  Psychol. 
Studien,  1910,  Bd.  vi.  pp.  141,  252,  430.  Urban  on  Wirth,  Archiv  f.  d.  ges.  Psychol  xx. 
Literaturbericht,  p.  1. 
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The  table  is  self-explanatory.  Each  column  is  formed  from  the 
preceding  one  by  successive  summations  (from  the  top  in  this  case), 
and  is  then  totalled*.  The  origin  is  here  the  centre  of  the  group  beyond 
108  grams,  viz.  110  grams,  and  the  unit  of  measurement  is  4  grams, 
measured  downwards  from  110  grams.  We  have 

Mean  =  110  -  4d  =  99-6712  grams. 
Further,  it  can  be  shown  that  the  moments 

1/2  =  2S3  -  d  (1  +  d), 

v3  =  6S4  -  3i/2  (1  -I-  d)  -  d  (1  +  d)  (2  +  d), 

i>4  =  24S6-2i/3  (2  (!  +  <*)  +  !} 

-  v2  {6  (1  +  d)  (2  +  d)  -  1}  -  d  (1  +  d)  (2  +  d)  (3  +  d). 

The  work  is  not  heavy  up  to  this  point  if  arranged  systematically, 
and  in  the  present  case  it  gives 

1/2  =  1-6296, 
1/3  =  0-9645, 
v±  =  9-1621, 

or,  using  Sheppard's  corrections, 

^  =  1-5463,  a  =  V>2  =  1-2435, 

to  =  0-9645, 
/i4  =  8-3765. 

This  a  gives  in  original  units  4  x  1-2435  =  4-974  grams,  differing  from 
the  value  mentioned  on  p.  63  because  of  the  Sheppard  adjustment. 
From  the  moments  we  obtain 

ft  =  0-251, 
ft  =  3-51, 
k2  =  0-775. 

The  type  is  therefore  Type  IV;  within  the  limits  of  probable  error 
of  ft,  ft,  and  K2  it  might  however  be  Type  G,  VII,  or  V.  Unfortunately 
the  ordinary  methods  of  finding  these  probable  errors  are  of  doubtful 
significance  in  the  case  of  pseudo-histogram  data  such  as  ours.  We  turn 
to  the  calculation  of  Type  IV. 

*  Slightly  greater  accuracy  could,  by  the  way,  be  attained  by  using  the  actual  numbers 
of  answers  heavier  and  not  the  proportions  p  in  cases  like  the  present  where  an  awkward 
number  of  experiments  was  performed  at  each  stimulus,  viz.  450,  leading  to  recurring 
decimals.  The  totals  would  then  be  divided  by  this  number. 
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(6)  CALCULATION  OF  A  SKEW  CURVE  (Type  IV) 

The  equation*  is 

/.       x2\-m 
e 


rl/n  say, 


89 


wherein  m  =  J  (r  -f  2)  say, 

where  r  =  6  (B9  -  &  - 


"U   TT" 


$&  (skewness)  =  Z/(4w). 
Origin,  at  mean  +  va/r; 
mode,  at  mean  —  skewness  x  or; 

COS20 

N    Vr    e  3r 


127 


tan 


a  V(2?r)  (cos  «^)r+ 
The  actual  form  of  the  calculation  depends  on  the  appliances  avail- 
able, whether  Crelle's  Tables,  or  logarithms,  or  calculating  machines. 
Using  Crelle  and  seven  figure  logarithms  we  get  finally,  after  much 
labour,  the  following  values  for  the  ogive,  which  has  to  be  obtained 
from  the  bell-curve  by  simple  quadrature: 

Urban' s  Subject  /,  Heavier  answers 


Stimulus 

Observed  p 

Calculated  by 
Type  IV 

(p-fOVW) 

84 
88 
92 
96 
100 
104 
108 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•0046 
•0184 
•0690 
•2155 
•5006 
•8078 
•9624 

•0012 
•0001 
•0062 
•0002 
•0305 
•0496 
•0139 

Sum  -1017 

X2  _  450  x  4017  =  45-8,     P  <  -0000005  still. 

We  find  therefore  that  fitting  the  best  Pearson  curve  possible  to  the 
data  makes  practically  no  improvement  in  the  fit,  which  is  still  so  very 
bad  as  to  make  it  quite  certain  that  the  data  are  not  homogeneous  at 
all.  The  fit  cannot  be  much  improved  by  other  assumptions  as  to  the 
spread  of  the  "tail,"  several  of  which  have  been  tried.  The  Type  IV 

*  The  notation  on  this  page  is  that  of  Elderton  following  Pearson,  and  the  symbols 
m,v,n  and  r  have  no  connection  with  these  symbols  used  elsewhere  in  the  present  book. 
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curve  itself  is  shown,  contrasted  with  the  pseudo-histogram,  in  Fig.  12. 
The  mean  (99-67),  median  (99-97)  and  mode  (100-52)  are  worth  com- 
paring, as  a  matter  of  interest,  with  the  thresholds  obtained  otherwise 
(see  pp.  53,  60,  63  and  70). 

The  reason  for  the  bad  fit  in  this  case  is,  mathematically,  the  im- 
possibility of  finding  a  curve  to  accommodate  both  the  tall  rectangle 
217,  and  the  tail  of  27,  however  the  latter  may  be  allocated.  Much  of 
Urban's  other  data  shows  the  same  bad  fit,  and  for  the  same  reason,  the 
size  of  the  "tails."  Not  all  however  are  bad.  The  best  case  is  Subject  II 


Mean     »    99-67  grs. 
Mode     -  100-52  „ 
Median  *    99-97   . 


84          88          92          96         100        104         1( 
Fig.  12.  A  Type  IV  curve  fitted  to  Urban's  data  for  Subject  I,  heavier 
answers,  in  pseudo-histogram  form 

(Urban  himself)  who  had  had  much  practice  at  this  form  of  experiment. 
The  lighter  answers  in  his  case  were  as  follows,  compared  (1)  with  a 
curve  fitted  by  the  Constant  Process  and  (2)  with  a  Pearson  skew  curve 
(here  Type  I). 

Urban's  Subject  77,  Lighter  answers 


Grams 

Observed  p 

Normal  Curve  p 

Type  I  Curve  p 

84 
88 
92 
96 
100 
104 
108 

•9333 

•8622 
•7000 
•4489 
•2311 
•0956 
•0156 

•9504 
•8540 
•6767 
•4456 
•2320 
•0922 
•0272 

•9432 

•8520 
•6875 
•4627 
•2379 
•0858 
•0187 

On  testing  the  goodness  of  fit  we  obtain 

For  Normal  Curve,  P  =  0-48. 
For  Type  I  Curve,  P  =  0-91. 
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Here  the  Gaussian  is  a  good,  and  Type  I  an  excellent  fit.  There  is  every 
reason  then  to  think  that  the  data  here  are  homogeneous.  The  bell- 
curve  and  pseudo-histogram  are  shown  in  Fig.  13. 

Since  we  have  decided  that  the  data  of  Urban's  Subject  I  are  hetero- 
geneous, that  is,  that  the  conditions  of  the  experiment  varied  consider- 
ably during  its  performance  (which  lasted  several  months),  the  question 
naturally  arises  as  to  whether  we  can  analyse  the  data  mathematically 
into  two  or  more  frequency-distributions.  This  question  was  discussed  by 
Professor  Pearson  in  1894*  as  far  as  an  analysis  into  two  normal  curves 
goes.  One  more  moment,  /Lt5,  is  needed  and  as  the  probable  error  of  this  is 
considerable  the  practical  application  of  the  plan  to  be  described  is  not 
very  satisfactory. 

1-200 


-100 


80          84          88          92          96         100        104        108 
Fig.  13.   A  Type  I  curve.   Urban's  Subject  II,  lighter  answers 

(7)  ANALYSIS  INTO  TWO  NORMAL  CURVES 
Stage  I.    Find  the  centroid  of  the  frequency  curve  and  calculate 


A5  =  SO/z^  —  3/z5. 

Stage  II.    Solve  the  following  nonic  equation  for  p2  using  Sturm's 
functions  to  localise  roots: 

36/vV  -  (24/^A5  -  10A42)  pf 
A4  +  2A62)  p2*  +  (288^  -  12A.V.  -  V)  ?23 
5  -  7^*)  P22  +  32/i,«AiP«  -  2  W  =  0, 
and  find  p1  from 


*  Phil  Trans.  Hoy.  Soc.  London,  1894. 
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Ayx  and  ^y2  are  the  positions  of  the  axes  of  the  normal  component  curves, 
where  h  is  the  unit  of  length. 

Stage  IV.    The  fractions  zx  and  z2,  that  the  areas  of  the  component 
curves  are  of  the  area  of  the  whole  curve,  form  the  roots  of  the  quadratic 


Pi    ~     P2 

Stage  V.    The  standard  deviations  are  found  from 


80         84          88         92          96         100        104        108        112  grams 

Fig.  14.   Pseudo-histogram  analysed  into  two  normal  curves,  with  their  sum. 
Urban's  Subject  I,  heavier  answers  (fit  still  very  poor) 

The  above  refers  to  asymmetrical  frequency  curves.  For  symmetrical 
curves  (where  the  components  have  the  same  axis)  modifications  are 
required. 

On  carrying  out  this  very  troublesome  calculation  with  the  hetero- 
geneous data  of  Urban's  Subject  I,  heavier  answers,  the  nonic  was  found 
to  have  three  possibly  usable  roots :  but  two  of  these  led  to  imaginary 
quantities,  leaving  only  a  root  which  led  to  the  following  analysis, 
approximately: 

First  component  Second  component 

Centre         98       grams  101-7    grams 

a-                    5-73      „  3-54     „ 

Area          250  200 


CH.  IV] 


SKEWNESS  AND  HETEROGENEITY 


93 


The  two  curves  are  drawn  on  the  adjoining  figure.  On  adding  them 
and  finding  their  areas  between  the  appropriate  points  (84,  88  grams, 
etc.)  we  arrive  at  the  following  comparison: 

Urban' s  Subject  I,  Heavier  answers 


Grams 

p  observed 

p'  of  Compound 

(p-p'YI(pfqf) 

84 
88 
92 
96 
100 
104 
108 

•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 

•0039 
•0228 
•0836 
•2262 
•4938 
•8031 
•9603 

•001 
•000 
•000 
•000 
•026 
•054 
•016 

•097=  sum 

X2 = 450  x  -097  =43-65. 

The  decrease  in  the  value  of  x2  obtained  by  dissecting  into  two 
normal  curves  is  therefore  only  very  slight  indeed.  P  is  still  only  about 
•000001  in  value.  The  heterogeneity  is  more  complex  than  can  be  thus 
dealt  with. 

(8)  CONCLUSIONS 

Further  analysis  is  useless.  For  since  there  are  only  seven  points 
given  by  experiment  in  the  curve,  and  also  the  total  area,  it  is  clear 
that  an  exact  fit  could  be  obtained  by  two  skew  curves,  or  a  skew  and 
a  normal,  which  have  between  them  eight  constants;  or  in  many  ways 
by  three  normal  curves. 

All  we  can  say  is  that  this  subject's  sensitivity  oscillated  between 
at  least  two  states;  and  if  only  two,  then  one  of  these  is  such  as  to  give 
a  skew  curve.  Without  attempting  to  get  an  exact  fit,  it  is  clear  after 
our  experience,  that  provided  we  supply  a  flat  normal  curve  to  give 
the  awkward  tails,  the  surplus  of  the  distribution  could  be  fitted  success- 
fully by  a  skew  curve. 

This  suggests  that  the  two  components  into  which  the  heterogeneous 
data  are  thus  divided  are  (a)  a  component  due  to  erratic  answers  giving 
a  wide  shallow  distribution  and  (6)  a  component  really  corresponding 
to  the  conditions  of  the  experiment. 

The  use  of  "catch"  tests  in  threshold  determination  is  to  check 
component  (a).  The  proper  way  to  employ  them  is  to  cancel  sittings 
at  which  numerous  catch  errors  occur,  the  subject  being  then  pre- 
sumably in  an  abnormal  state. 

In  Riecker's  experiment  many  "catch  errors"  were  present,  i.e. 
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occasions  when  the  subject  answered  two  to  a  one-point  stimulus.  Fitting 
the  best  normal  curve  to  his  data,  we  only  get,  on  testing  its  goodness 
of  fit,  P  =  -0003. 

Riecker's  Data 

Mean    2-02,        0-  =  1-46  Paris  Lines 


Paris 
Lines  « 

Observed  p 

«-2-02 

*-2-02 

Gaussian  p' 

p-p> 

(p-p')*lp'qf 

1-46 

0 

•00 

-2-02 

-1-38 

•084 

-•084 

•092 

0-5 

•10 

-1-52 

-1-04 

•149 

-•049 

•019 

1 

•14 

-1-02 

-0-70 

•242 

-•102 

•057 

1-5 

•40 

-0-52 

-0-36 

•359 

•041 

•007 

2 

•65 

-0-02 

-0-014 

•494 

•156 

•097 

3 

•80 

0-98 

0-67 

•749 

•051 

•014 

4 

•87 

1-98 

1-36 

•913 

-•043 

.023 

5 

•96 

2-98 

2-04 

•979 

-•019 

•018 

6 

1-00 

3-98 

2-73 

•997 

•003 

•003 

Sum    -330 

X2  =  100x-330,  n'  =  10,  P  =  -00031. 

Compare  with  this  the  following  case.  The  data  are  for  the  spatial 
threshold  on  the  forearm*,  and  were  gathered  by  the  Method  of  Non- 
Consecutive  Groups.  Sittings  containing  more  than  a  certain  number  of 
catch  errors  were  rejected,  the  number  chosen  being  one  which  when 
exceeded  at  all  was  usually  exceeded  violently.  Other  experimental 
precautions  were  taken  which  are  described  in  the  articles  cited.  As  a 
result  it  is  found  that  a  roughly  fitted  Normal  Curve  is  quite  a  fair  fit, 
for  in  13  out  of  100  cases  a  worse  departure  would  be  got  by  chance: 
and  a  skew  curve  would  improve  this.  The  data  are  in  fact  reasonably 
homogeneous.  (Calculations  on  opposite  page.) 

The  conclusion  of  the  whole  matter  is  that  we  are  led  to  believe  that 
the  difficulties  of  psychophysical  experiment  are  such  that  homogeneity 
in  the  data  is  rare.  For  such  data  refinements  of  mathematical  calcula- 
tion are  out  of  place.  The  curve  fitting  methods  here  described  are 
however  of  value  in  discovering  the  heterogeneity f. 

With  increasing  precautions  in  carrying  out  the  experiments  and 
with  increasing  practice  on  the  part  of  the  subject,  it  would  appear  that 
the  data  finally  reach  a  distribution  where  they  are  fairly  well  fitted  by 
a  "Normal  Curve,  and  excellently  fitted  by  Pearsonian  Skew  Curves. 

*  G.  H.  Thomson  "A  Comparison  of  Psychophysical  Methods,"  Brit.  Journ.  Psychol. 
1912,  v.  pp.  203—241;  "Changes  in  the  Spatial  Threshold  at  a  Sitting,"  Brit.  J&urn. 
Psychol  1914,  VL  pp.  432 — 448  and  B.A.  Report,  1913,  pp.  681—683. 

t  Compare  Urban's  use  of  the  Lexian  Coefficient  of  Dispersion  (op.  cit.)  and  compare 
the  latter  with  Pearson's  more  significant  criteria  /9t,  £2»  and  the  /S^  diagram  (XXXV 
in  his  Tables). 
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Normal  Integral  fitted  to  Thomson's  Spatial  Threshold  Data 


Cms. 

Two-point 
answers 

Continued 
sum 

Cms.  from 
mean 

In  <r 
units 

Theoretical 
p'  from 
Sheppard 

Observed  p 

(p-p')*lp'qf 

0 

5-2 

5-2 

-2-44 

-2-29 

•Oil 

•035 

•05293 

i 

6 

11-2 

-1-94 

-1-82 

•034 

•040 

•00109 

1 

8 

19-2 

-1-44 

-1-35 

•089 

•053 

•01599 

1* 

21 

40-2 

-0-94 

-0-88 

•189 

•140 

•01565 

2 

56 

96-2 

-0-44 

-0-41 

•341 

•374 

•00485 

2* 

84 

180-2 

0-06 

0-06 

•524 

•560 

•00521 

3 

105 

285-2 

0-56 

0-52 

•698 

•700 

•00002 

3* 

125 

410-2 

1-06 

0-99 

•839 

•834 

•00018 

4 

141 

551-2 

1-56 

1-46 

•928 

•940 

•00216 

4* 

144 

695-2 

2-06 

1-93 

•973 

•960 

•00644 

5 

148 

843-2 

2-56 

2-40 

•992 

•987 

•00315 

(  5 

843-2 

3137-2 

Sum  =-10767 

150  J 

(30 

168-64 

627-44 

5-621 

20-915 

d 

S* 

d  in  cms.  is  2-81  from  an  origin  of  5J  downwards. 
Mean  =  5-25  -  2-81  =  2-44  cms. 
25  =41  -83 


v2  =  4-63 
Sheppard's  correction      0-08 


/*,=  4-55  =0-* 

<r  =  2-133 
or      1-067  cms. 
X2  =  150  x  -10767  =  16-15, 
n'  =  12,  P=0-13. 


PAET  II 
CORRELATION 

CHAPTER  V 

INTRODUCTION  TO  CORRELATION 

A  SOMEWHAT  detailed  account  of  the  mathematical  theory  of  correlation 
and  of  the  way  in  which  it  may  be  usefully  applied  to  psychological 
measurements  will  be  found  in  the  later  chapters  of  this  Part.  The  object 
of  the  following  introductory  pages  is  to  give  the  reader  a  general  pre- 
liminary view  of  the  method,  free  from  mathematical  complications, 
and  to  illustrate  it  by  means  of  a  simple  example. 

Correlation  may  be  briefly  denned  as  "tendency  towards  concomitant 
variation,"  and  a  so-called  correlation  coefficient  (or,  again,  correlation 
ratio)  is  simply  a  measure  of  such  tendency,  more  or  less  adequate 
according  to  the  circumstances  of  the  case.  J.  S.  Mill,  in  his  "System 
of  Logic,"  distinguished  a  special  scientific  "Method  of  Concomitant 
Variations,"  which  he  based  upon  the  following  principle: 

"Whatever  phenomenon  varies  in  any  manner  whenever  another 
phenomenon  varies  in  some  particular  manner,  is  either  a  cause  or  an 
effect  of  that  phenomenon,  or  is  connected  with  it  through  some  fact 
of  causation*." 

The  instances  of  this  principle  which  Mill  had  in  mind  were  mainly 
cases  of  approximately  "complete"  concomitance  of  variation,  such  as 
those  usually  met  with  in  the  domain  of  Physics.  In  such  cases,  the 
conditions  of  an  experiment  admit  of  a  high  degree  of  simplification, 
the  phenomenon,  or  series  of  phenomena,  under  investigation  can  be 
isolated  with  tolerably  complete  success,  and  the  "  irrelevant "  factors  can 
be  reduced  to  a  minimum.  Under  such  conditions,  when  the  degree  of 
concomitance  of  the  different  corresponding  measures  of  the  two  pheno- 
mena is  found  to  be  very  high,  the  slight  deviations  from  complete 

*  Logic,  Bk.  m.  Ch.  viii.  §  6. 
B.  &  T.  7 
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correspondence  are  put  down  to  "errors  of  observation"  or  other  un- 
avoidable imperfections  in  the  experimental  method  employed. 

If  the  correspondence  is  one  of  simple  proportionality,  so  that  the 
graphical  representation  of  it  (one  phenomenon  being  measured  along 
the  axis  of  x,  the  other  along  the  axis  of  y)  is  a  straight  line*,  the  corre- 
lation coefficient  r  will  be  unity.  Example:  the  variation  of  the  length 
of  a  metal  rod  with  temperature. 

If  the  correspondence,  although  still  approximately  complete,  is  not 
one  of  simple  proportionality,  the  graphical  representation  of  it  will  be, 
not  a  straight  line,  but  a  curve  of  greater  or  less  complexity  f,  and  the 
correlation,  also  complete,  will  be  measured  not  by  the  correlation 


k 

•\- 
*\* 


X. 


•'•x. 

**: 


fc 

:^b, 
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coefficient  r,  but  by  the  correlation  ratio  77,  which  in  this  case  will  be 
unity.  Example:  the  variation  of  the  volume  of  a  certain  quantity  of 
gas  with  the  pressure  to  which  it  is  subjected,  the  temperature  remaining 
constant.  A  number  of  pairs  of  values  Pl9  Fx;  P2,  F2;  P3,  F3,  etc.  is 
obtained,  and  when  plotted  they  are  found  to  give  a  "scatter  diagram" 
of  the  approximate  form  of  Fig.  15. 

In  this  figure,  the  dots  represent  the  individual  pairs  of  observations 
P,  F.  They  cluster  very  closely  about  the  hyperbola,  PV  =  k,  repre- 
sented by  the  broken  curve.  The  curve  is  assumed  to  represent  the 
"real"  or  "true"  relation  of  the  two  "variates"  (as  we  call  such 
quantities  as  P,  F),  and  the  slight  deviations  of  the  observed  values 


*  Hence  the  correlation  is  said  to  be  "linear." 
f  Correlation  said  to  be  "non-linear"  or  "skew. 


CH.  v]  INTRODUCTION  TO  CORRELATION  99 

from  this  curve  are  explained  as  due  to  errors  of  observation  and  to 
other  factors  irrelevant  to  the  relation  under  investigation.  However 
this  may  be,  the  interesting  point  about  the  figure  so  far  as  our  present 
purpose  of  explaining  correlation  is  concerned  is  that  any  definite 
observed  P- value  is  "correlated"  with  a  plurality  or  "array"  of  observed 
F- values,  and  that,  similarly,  any  definite  observed  F- value  is  correlated 
with  a  plurality  of  P-values.  These  arrays  of  observed  values  cluster 
extremely  closely  about  their  means  (situated  on  the  curve),  i.e.  their 
"scatter"  or  "variability,"  as  measured  by  their  standard  deviations  (o-), 
is  extremely  small. 

The  modern  theory  of  correlation  is  directed  towards  the  manipula- 
tion of  observations  made  upon  phenomena  of  a  much  greater  degree 
of  variability  than  that  found  in  the  case  of  isolated  physical  phenomena. 
The  increased  variability  is  no  doubt  due,  in  the  main,  to  the  complexity 
of  factors  involved.  The  elementary  factors  do  not  admit  of  isolation, 
and  with  reference  to  the  concomitance  of  variation  of  the  two  series 
of  phenomena  under  consideration  they,  as  it  were,  pull  in  different 
directions.  The  correlation  coefficient  and  correlation  ratio  measure,  in 
these  cases,  the  average  extent  of  the  concomitance.  As  will  be  explained 
more  fully  in  the  next  chapter,  r  can  only  be  taken  as  a  measure  of  corre- 
lation when  the  average  relation  between  the  two  variates  is  linear,  and 
in  this  case  its  value  is  identical  with  that  of  77.  When  the  relation  is 
non-linear  r  is  practically  meaningless,  but  77  still  measures  the  relation 
accurately. 

The  general  problem  will  become  clearer  by  reference  to  the  ac- 
companying figure. 

Let  us  assume  that  we  have  a  group  of  200  school-children  and  have 
measured  each  of  them  for  mechanical  memory  (x)  and  for  general 
intelligence  (y).  Each  of  the  dots  in  the  figure  represents  a  child.  Then 
if  we  determine  the  mean  ^/-values  corresponding  to  each  successive 
"group"  of  x- values,  e.g.  x2  to  x3,  by  assuming  the  observations  con- 
centrated on  the  mid-ordinate  PM*,  the  line  AB"\  drawn  through  these 
mean  y- values  (marked  by  crosses  in  heavy  type)  represents  the  law  of 
change  of  mean  y- value  with  increase  of  x  and  gives  the  "most  probable" 

*  The  true  centroid  ordinate  is  slightly  nearer  the  denser  part  of  the  scatter  diagram, 
here  slightly  towards  the  right  of  PM.  The  correction  is  made  later  by  means  of  Sheppard's 
formulae  (see  above,  p.  84). 

f  The  means  have  been  placed  on  the  straight  line  for  the  sake  of  convenience  of 
exposition.  Actually,  they  will  occur  irregularly  on  either  side  of  it,  and  AB  will  be  the 
"best  fitting"  straight  line,  determined  by  an  application  of  the  Method  of  Least  Squares. 
See  next  chapter. 

7—2 
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value  of  y  for  any  particular  value  of  x.  If  the  line  is  straight  or  ap- 
proximately straight  the  "regression"  is  said  to  be  linear,  and  the 
equation  to  the  line  is 

y  -  y  =  r  -  (x  -  x), 

°i 

where  x,  y  are  the  mean  values  of  all  the  cc's  and  y's  respectively  (not 
the  means  of  the  arrays  just  mentioned),  al9  a2  are  the  standard 
deviations  of  the  x's  and  y's  respectively,  and  r  is  the  coefficient  of 
correlation. 


mech.  memory 


Fig.  16 


Thus  if 


20  is  the  mean  value  of  the  mechanical  memory  of  the  group, 
30  „     „  „          „        general  intelligence  of  the  group, 

4  „     „  standard  deviation  for  mechanical  memory  (o^), 
7,,     „  „          „        „     general  intelligence  (or2), 

and,  finally,  -6  is  the  value  of  r\  then  the  most  probable  measure  of  the 
general  intelligence  of  a  child  whose  mechanical  memory  is  represented 
by,  say,  the  value  14,  is  given  by  the  equation 

y  -  30  =  -6  x  \  (14  -  20), 
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whence  y  =  23-7.    This  value  is  the  average  of  an  array  of  possible 
values,  whose  standard  deviation 


=  5-6. 

It  will  be  proved  in  the  next  chapter  that 


where  x  and  y  are  deviations  from  the  mean  (not  absolute  values  as 
assumed  above),  and  S  (  )  indicates  summation,  i.e. 

S  (xy)  =  X&!  +  x2y2  +  xsy3  +  ...  +  xNyN, 
where  N  is  the  total  number  of  cases  (children  measured). 

It  is  important  to  note  that  by  starting  from  y  instead  of  from  x} 
and  determining  the  means  of  the  ^/-arrays  (such  as  the  array  within 
the  limits  y2y^  2/32/3'  )>  another  regression  line,  CD,  is  obtained  different 
from  the  first.  Its  equation  is 

x  -  x  =  r  ^  (y  -  y), 

°2 

and  it  represents  the  law  of  change  of  mean  x-  value  with  increase  of  y. 
It  gives  the  "most  probable"  value  of  x  for  any  particular  value  of  y. 

If  the  series  of  means  do  not  lie  on  a  straight  line  (approx.)  but  on 
a  curve  of  greater  or  less  complexity,  the  above  calculation  is  meaning- 
less. In  such  a  case,  called  a  case  of  skew  correlation  and  non-linear 
regression,  the  only  measure  of  the  correlation  of  the  two  variates  is  that 
given  by  77,  the  correlation  ratio,  77  is  the  ratio  of  the  standard  deviation 
of  the  means  of  the  arrays  (S)  to  the  total  standard  deviation  (of  either 
the  re's  or  the  ?/'s).  Thus  there  are  two  values  of  77,  one  for  the  cc's,  and 
another  for  the  y's.  They  approximate  closely  to  one  another,  as  a  rule, 
so  that  only  one  need  be  calculated. 

Si          S2 

*n  =  ~>  or  ~f- 

°1  °2 

When  the  regression  is  linear,  77  =  r;  otherwise  rj  >  r.  r  ranges 
between  the  values  ±  1,  77  between  0  and  1.  77  is  always  positive. 

It  will  now  have  become  clear  that  the  correlation  ratio,  77  (always) 

*  First  suggested  by  Bravais;  shown  to  be  the  best  measure  by  Professor  Karl 
Pearson,  who  gave  it  the  name  of  the  "product-moment"  formula.  A.  Bravais,  "Analyse 
mathematique  sur  les  probabilites  des  erreurs  de  situation  d'un  point,"  Acad.  des  Sciences, 
Memoir  es  presenter  par  divers  savants,  IP  Serie,  ix.  1846,  p.  255.  Karl  Pearson,  F.R.S., 
"Regression,  Heredity  and  Panmixia,"  Phil  Trans.  Boy.  Soc.,  1896,  CLXXXVII.  A, 
pp.  253  ff. 


•. 


102  CORRELATION  [PT.  n 

and  the  correlation  coefficient,  r  (when  regression  is  linear)  are  measures 
of  the  tendency  towards  concomitant  variation  exhibited  by  two  series 
of  phenomena,  and  hence  throw  some  light  upon  the  causal  relations 
of  these  phenomena.  Exactly  what  kind  of  causal  relation  we  are  justified 
in  inferring  from  them  will  become  clearer  in  the  course  of  the  next  few 
chapters. 

We  may  illustrate  the  significance  of  the  idea  of  correlation  in  a 
slightly  different  (and  more  elementary)  way.  Let  us  suppose  that  the 
200  children  have  been  arranged  in  order  of  merit,  as  regards  mechanical 
memory,  on  the  one  hand,  and  as  regards  general  intelligence  on  the 
other.  If  now  it  were  found  that  each  child's  order  was  the  same  in 
both,  i.e.  that  the  child  first  in  mechanical  memory  was  first  in  general 
intelligence,  the  child  second  in  mechanical  memory  was  second  in  general 
intelligence,  and  so  on,  correspondence  between  the  two  series  would  be 
complete  and  r  would  equal  +  1.  Or  if,  on  a  second  supposition,  the  child 
first  in  the  one  was  last  in  the  other,  the  child  second  in  the  one  was 
next  to  last  in  the  other,  and  so  on,  the  correspondence  between  the  two 
series  would  again  be  complete,  but  inverse,  and  r  would  be  —  1.  Finally, 
if  there  is  no  correspondence  whatever  between  the  two  series,  r  will 
be  zero.  A  value  of  r  between  0  and  +  1  will  express  a  tendency,  greater 
or  less  according  to  r's  size,  for  children  above  the  average  or  mean 
position  in  the  one  ability  to  be  above  the  mean  position  in  the  other, 
and  for  children  below  the  mean  position  in  the  one  to  be  below  the 
mean  position  in  the  other.  A  value  of  r  between  0  and  —  1  will  express 
a  tendency,  greater  or  less  according  as  r  is  numerically  greater  or  less, 
for  the  children  above  the  mean  position  in  the  one  ability  to  be  below 
the  mean  position  in  the  other,  and  conversely.  Now  if  order  or  "rank" 
be  taken  as  an  inverse  measure  of  ability,  the  value  of 


becomes  6S 


N  (N2  -  1) ' 

where  d  is  the  difference  between  the  rank  of  an  individual  in  the  one 
series  and  his  rank  in  the  other.  This  form  gives  us  a  general  impression 
of  its  appropriateness  for  the  purpose  in  view,  since  the  greater  the 
disparity  between  the  two  series  of  ranks  the  greater  is  S  (d2)  and  hence 
the  smaller  is  r.  If  there  is  no  relation  at  all  between  the  two  series, 
S  (d2)  acquires  the  value  it  would  have  according  to  pure  chance,  and 
this  can  be  shown  to  be  N  (N2  —  l)/6,  which  makes  the  whole  expression 
zero,  as  it  should  do. 
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The  one  objection  to  the  formula  is  that  it  assumes  the  difference 
between  any  two  neighbouring  ranks  to  be  equal  at  all  parts  of  the 
scale.  This  is  obviously  a  false  assumption;  the  distance  of  individual 
from  individual  at  the  two  extreme  ends  of  the  scale  must  be  con- 
siderably greater  than  that  between  individuals  near  the  middle.  A 
correction  for  this,  based  on  the  assumption  that  the  form  of  distribution 
of  the  abilities  in  each  of  the  cases  is  Normal,  has  been  calculated  by 
Professor  Pearson.  It  is 


,  .          6S  (d?) 

where  p  =  1  — 


N  (N2  -  1) ' 

At  the  end  of  this  chapter  is  given  a  table  whereby  p- values  may  at 
once  be  converted  into  corresponding  r-values,  according  to  the  above 
equation. 

Finally,  there  is  the  question  of  the  "probable  error"  (P.E.).  Like 
every  other  constant  calculated  from  a  limited  sample  of  variable 
material,  the  coefficient  of  correlation  varies  in  value  from  sample  to 
sample,  and  a  measure  is  needed  of  the  limits  within  which  it  may  be 
expected  with  a  fair  degree  of  probability  to  lie.  This  measure  is  given 
by  the  probable  error.  In  the  case  of  r  determined  by  the  product- 
moment  formula,  when  N  is  sufficiently  large, 

•67449  „ 


which  means  that  it  is  an  even  chance  that  the  true  value  of  r  lies 
between  the  limits 

•67449  (1  -  r2) 

The  chances  are  16  to  1  against  the  value  falling  outside  the  limits 
r  ±  3  P.E. 

For  r  determined  by  the  rank  formula,  the  probable  error  is  slightly 
larger,  being  -7063  (1  -  r2)/i/N. 

If  N,  the  number  of  cases,  be  small  (say,  less  than  30),  the  probable 
error  is  larger.  Its  exact  size  under  such  conditions  is  not  known. 

The  following  is  an  example  of  the  way  in  which  a  correlation 
coefficient  may  be  obtained  by  means  of  ranks.  The  subjects  were  boys 
in  the  Fourth  Form  of  a  Public  School,  and  the  correlation  to  be  obtained 
is  that  between  ability  in  Classics  and  ability  in  Drawing. 
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Form  Order 

Classics          Drawing  dz 

9  (1~9)2=  64 

2  0 

16  49 

6  1 
15  144 

4  1 

17  121 

3  1 

5  9 
14  9 

12  4 

7  49 
1  121 

8  25 

10  25 

18  4 

11  36 

13  25 

19  _0 
688=S(d2) 

-  i          w  v™  /       _  i  _    6  x  688  _     A* 
N  (N*  _  i)          "  19  x  360 

.'.    r  =  2  sin  fcp\  =  416, 
7063  (1  -  r2) 


R.  C.  0. 

1 

H.  G.  M. 

2 

B.L. 

9 

F.  L.  S. 

7 

C.  M.  S. 

3 

C.  J.  L.  H. 

5 

A.  L.  P. 

6 

E.  G.  T. 

4 

F.  C.  F. 

8 

N.  P  R.  N. 

11 

H.  B.  D. 

10 

S.  H.  T. 

14 

H.  B.  M. 

12 

L.  H.  S. 

13 

J.  P.  C. 

15 

E.W. 

16 

C.  C.  M. 

17 

L.  H.  W. 

18 

E.  M.  J. 

19 

P.E. 


VN 


r  is  here  just  over  three  times  its  probable  error,  and  we  might 
therefore  feel  inclined  to  conclude  that  it  proves  a  real  correlation 
between  the  two  series.  We  must  remember,  however,  that  19  is  a  very 
small  number  of  cases,  and  that  therefore  the  real  probable  error  is 
considerably  larger  than  that  given  by  the  formula.  Hence  the  reality 
of  the  correlation  is  not  so  certain.  Our  caution  is  proved  to  be  justified 
when  we  turn  to  the  next  higher  form,  the  Remove,  and  find  that,  with 
the  same  number  of  boys,  the  correlation  between  ability  for  Classics  and 
Drawing  ability  works  out  as  —  -313  (±  -14),  quite  a  different  result.  It 
might  be  objected  that  other  factors  than  mere  smallness  in  the  number 
of  cases  were  responsible  for  the  difference;  e.g.  that  the  tendency  to 
specialise  in  Classics  was  greater  in  the  Remove  than  in  the  Fourth,  and 
that  the  consequent  neglect  of  Drawing  by  the  abler  boys  lowered  the 
correlation.  To  this  it  may  be  replied,  firstly,  that  the  drawing-master 
was  the  same  for  both  forms,  and  was  likely  to  get  as  much  out  of  the 
boys  as  possible  in  each  case,  and,  secondly,  that  the  difference  between 
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the  two  forms  in  respect  of  the  degree  of  specialising  tendency  was  in- 
sufficient to  account  for  the  disparity  of  the  results. 

The  correct  way  to  compare  the  results  mathematically  is  to  deter- 
mine the  probable  error  of  their  difference.  This  =  the  square  root  of 
the  sum  of  the  squares  of  the  probable  errors  of  each*,  i.e. 


which,  in  this  case,  =  V-132  +  -142 

=  •19. 

The  difference  =  -416  +  -313  =  -73,  nearly  four  times  the  size  of  its 
probable  error  =  -19. 

A  very  important  extension  of  the  theory  of  correlation  is  the  con- 
ception of  "partial"  correlation.  If,  e.g.,  three  mental  abilities  are 
correlated  with  one  another,  it  is  of  interest  to  know  how  closely  any 
two  of  them  are  correlated  with  one  another  for  a  constant  value  of  the 
third.  Such  a  coefficient  is  written,  in  Yule's  notation,  r12.3. 

This  may  be  illustrated  from  our  example  by  taking  the  form  order 
for  English  into  consideration  in  addition  to  that  for  Classics  and  that 
for  Drawing.  The  correlation  between  Classics  and  English  works  out 
as  '78,  that  between  Drawing  and  English,  as  -21. 

Then  the  correlation  between  Classics  and  Drawing  for  "English 
constant"  is 

rCD  — 


rCD.E  '• 

v  i  •»•  —  rcE~)  v  v1  —  'res'; 

.4.9  _  .7ft  v   .91 

•42. 


-rCE2)V(l- 
•42  -  -78  x  -21 


V(1--782)V(1-'212) 

.Thus  in  this  particular  case,  the  "partial"  coefficient  is  practically 
identical  with  the  "entire"  coefficient. 

If  therefore  boys  were  selected,  out  of  a  population  of  which 
the  actual  form  is  a  random  sample,  so  as  to  be  all  equal  in  their 
"English"  ability,  the  correlation  between  their  "Drawing"  ability  and 
their  "Classics"  ability  would  be  unaffected.  Of  course  such  a  set  of 
boys,  in  addition  to  being  all  alike  in  English,  would  be  less  scattered 
in  both  Classics  and  Drawing  (especially  in  the  former)  than  are  the 
boys  of  the  actual  form,  and  their  average  ability  in  these  subjects 
would  be  higher  or  lower  than  that  of  the  actual  form  according  to  the 
level  of  ability  in  English  at  which  they  had  been  selected. 

On  the  other  hand  the  partial  correlation  of  English  and  Drawing 

*  See  p.  24. 
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for  "  constant  Classics"  will  be  found  to  be  —  -2,  so  that  selection  for 
Classics  creates  a  negative  correlation  between  English  and  Drawing,  in 
so  far  as  we  can  judge  from  this  particular  case. 

The  reader  must  be  warned  against  the  temptation  to  draw  deduc- 
tions as  to  the  "  common  factors"  uniting  any  of  these  pairs  of  subjects. 
The  fallacy  underlying  such  reasoning  is  discussed  in  pp.  139  —  145. 

The  principle  of  partial  correlation  can  be  extended  to  include  an 
indefinite  number  of  variables,  and  general  formulae  for  this  purpose 
will  be  given  in  Chapter  VII. 

It  is  obvious  that  when  the  subjects  in  the  group  examined  are  not 
all  alike  in  respect  of  some  irrelevant  factor  such  as  age,  these  same 
formulae  can  be  employed  to  ascertain  what  the  correlations  would  be 
in  a  group  which  was  homogeneous  with  regard  to  the  factor  in  question. 
Great  care  has  to  be  taken  in  interpreting  the  results  of  such  calculations 
however,  as  fundamental  assumptions  may  not  be  satisfied.  This  will 
become  clearer  in  the  following  chapters. 


Table  for  converting  p  into  r  (r  =  2  sin 


p             r  p  r  p             r  p              r 

•05         -052  -30  -313  -55  -568  -80           -813 

•10         -105  -36  -364  -60  -618  -85           -861 

•15         -157  -40  -416  -65  -668  -90           -908 

•20         -209  -46  -467  -70  -717  -95           -954 

•25         -261  -50  -518  -75  -765  1-00  1-000 

*  Quoted  from  K.  Pearson,  F.R.S.,  Drapers'  Company  Research  Memoirs,  Biometric 
Series,  iv.  1907,  p.  18. 


CHAPTER  VI 

THE  MATHEMATICAL  THEORY  OF  CORRELATION 

Correlation  coefficient  r— Correlation  ratio  77 — Probable  errors — The  normal  correla- 
tion surface  and  its  properties — Other  methods  of  determining  correlation — Fourfold 
table — Method  of  contingency — Two-row  table — Short  methods — The  method  of 
ranks — Spearman's  foot-rule — Correlation  of  sums  or  differences. 

IN  the  present  chapter  an  attempt  will  be  made  to  summarise  briefly 
the  principal  methods  in  use  for  obtaining  a  measure  of  the  correlation, 
or  tendency  towards  concomitant  variation,  of  two  or  more  variates. 

Let  the  coordinates  of  the  dots  in  the  accompanying  diagram — 
commonly  known  as  a  "scatter  diagram" — represent  the  measures  of 
two  separate  characteristics,  e.g.  speed  of  adding  figures  (x)  and  accuracy 
of  adding  figures  («/),  in  a  number  of  individuals  (N). 


speed  of  adding  (a;) 
Fig.  17 

Let  the  crosses  represent  the  mean  values  of  y  corresponding  to 
values  of  x  lying  between  the  limits  of  pairs  of  successive  units  of  measure- 
ment. Then  the  broken  curve  CO'  passing  through  these  crosses  repre- 
sents the  most  probable  law  of  relationship  between  speed  of  adding 
and  accuracy  of  adding,  and  is  known  as  the  regression  curve.  (In 
practice  the  crosses  do  not  lie  so  accurately  on  the  curve.) 

(1)  CORRELATION  COEFFICIENT  (r) 

What  we  chiefly  want  to  know,  however,  even  when  the  regression, 
as  here,  is  not  linear,  is  (1)  whether  large  x  is  on  the  whole  associated 
with  large  y,  etc.,  (2)  how  to  find  roughly  the  mean  y  associated  with 
given  x.  To  do  this,  we  find  the  "best  fitting"  straight  line.  LL',  to  the 


108  CORRELATION  [PT.  n 

swarm  of  dots  in  the  figure,  using,  merely  from  motives  of  convenience, 
the  Method  of  Least  Squares*. 
Let  the  equation  of  LL'  be 

Y  =  b2lX  +  c. 

Then  applying  Least  Squares  will  make  S  (y  —  Y)2  a  minimum,  where 
y  is  the  ordinate  of  any  dot,  and  Y  the  ordinate  of  the  line  at  the  same 
abscissa,  which  is  both  X  and  x.  There  will  be  as  many  equations 

(y  -  7)  =  y  -  (b2lx  +  c)  =  v 

as  there  are  dots,  and  the  v's  correspond  to  the  "residuals"  of  the 
Method  of  Least  Squares,  though  here  they  are  real  deviations  and  not 
errors.  The  Normal  Equations  for  b21  and  c,  formed  according  to  the 
rule  on  p.  45,  give  at  once 

S(xy)-b2lS(x*)-cS(x)  =  0, 
S(y)    -&2iS(z)  -c£(l)  =  0. 

If  there  are  2V  points,  S  (1)  —  N',  and  if  the  sc's  and  ?/'s  are  measured 
from  the  mean  of  the  whole,  S(x)  =  S  (y)  =•  0.  The  equations  then  become 

S  (xy)  =  b21  8  (x2), 

Nc  =  0. 

Thus  c  is  zero,  that  is  the  line  LL'  goes  through  the  mean.  And  b2V 
which  is  the  tangent  of  the  slope  of  LL'  ,  is 

_8(xy)_8(xy) 
°21  "  S  (x2)  ==  1^7  ' 

The  line  LL'  is  known  as  the  regression  line,  and  b2l  as  the  coefficient 
of  regression  of  y  on  x.  If  we  define  r  as 

_S(xy) 


then  621  =  r 

and  the  equation  to  LL'  is 


x  and  y  being  measured  from  their  mean  values. 
An  analogous  equation 


gives  the  regression  of  x  on  y.    There  are  thus  two  regression  lines.   If 

*  G.  Udny  Yule,  "On  the  Significance  of  Bravaia'  Formulae  for  Skew  Correlation," 
Proc.  Roy.  Soc.  1896,  LX.  pp.  477  —  489. 
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x  and  y,  in  addition  to  being  measured  from  their  means,  are  also 
measured  in  terms  of  their  standard  deviations  as  unity,  the  regression 
equations  become 

X  =  rY  and    Y  =  rX, 

and  r  is  then  itself  the  coefficient  of  regression  of  y  on  x  and  of  x  on  y, 
the  two  regressions  being  equal. 

Since  y  —  Y  measures  the  distance  in  the  y  direction  of  any  point 
from  the  regression  line,  the  quantity 


N 

gives  the  mean  square  deviation,  in  the  y  direction,  of  all  the  points 
from  the  regression  line, 

S(y-bnx)*     S(y*)  S(xy)  S(a?) 


(1  -  r*)! 


Hence  the  standard  error  or  deviation  made  in  estimating,  by  means 
of  the  regression  equation,  the  value  of  y  most  probably  associated  with 
any  particular  x,  is,  on  the  average,  given  by 

o-2  x  V(l  -V2), 

and  if  the  distribution  is  Normal  it  has  this  value  not  only  on  the  average 
but  for  each  array. 

r  is  known  as  the  coefficient  of  correlation,  and  evidently  must  lie 
between  the  values  +  1  and  —  1.  If  the  regression  line  coincides  with 
the  regression  curve,  within  thejjjnits  of  errors  of  random  sampling,  — 
in  other  words,  if  the  regression  is  linear  —  r  is  a  measure  of  the  degree 
of  dependence  between  x  and  y.  When  r  =  ±  1,  the  points  close  up  upon 
the  line  and  the  "scatter  diagram"  contracts  to  become  the  line  itself. 

The  formula  r  = 


is  implied  in  Bravais'  work  of  1846,  and  was  shown  by  Professor  Karl 
Pearson  in  1896  to  be  the  best  measure  of  r.  Hence  it  is  known  as  the 
Bravais-Pearson  Product-Moment  Formula.  It  may  be  written 


_ 
" 


the  denominator  being  the  geometrical  mean  of  the  two  second  moments, 
and  the  numerator  the  product-moment,  of  x  and  y. 
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If  x  and  y  are  not  measured  from  their  means,  but  from  some  con- 
venient point  distant  d^  from  the  x  mean  and  d2  from  the  y  mean,  the 
arithmetic  is  very  considerably  lightened,  and  the  formula  becomes,  as 
may  be  tested  by  simple  algebra, 

8  (xy)  - 


r  = 


V{S  (z2)  -  Nd*}  V(S  (y*)  -  Nd/}  • 
The  following  example  is  intended  to  show  one  form  of  calculation 
based  on  this  formula.  Being  only  a  model  for  this  purpose,  it  is  kept 
short  so  that  the  arithmetic  can  be  easily  followed.  But  it  must  be  made 
quite  clear  that  really  to  calculate  correlations  with  only  ten  cases  is  absurd, 
for  the  probable  errors  are  enormous  and  moreover  are  unknown  (see 
p.  114).  A  calculation  with  larger  numbers,  and  made  by  a  slightly 
different  process,  is  given  on  p.  115  et  seq. 

In  the  following  table  A  and  B  are  the  percentage  errors  made  by 
certain  cadets  in  a  test  in  judging  distance,  in  the  years  1915  and  1916 
respectively. 


Name 

A 

B 

x  =  (A-lQ) 

y=(*-13) 

x* 

ya 

xy 

Cadet  A 

15 

15 

-1 

2 

1 

4 

-  2 

„  B 

19 

10 

3 

-3 

9 

9 

-  9 

„  c 

11 

14 

-5 

1 

25 

1 

-  5 

„  D 

17 

16 

1 

3 

1 

9 

3 

„  E 

8 

18 

-8 

5 

64 

25 

-40 

„  P 

14 

12 

-2 

-1 

4 

1 

2 

»  G 

24 

14 

8 

1 

64 

1 

8 

„  H 

28 

11 

12 

-2 

144 

4 

-24 

„  K 

7 

8 

-9 

-5 

81 

25 

45 

»  L 

14 

14 

-2 

1 

4 

1 

-  2 

Means     15-7   13-2 

Sums     397    80    -24 

Provisional  16     13 

Subtract     0-9   0-4   -  0-6  (i.e.  Ndd) 

d1=-.3d2  =  -2 

396-1   79-6   -23-4 

23-4 


~ 


_ 


(2)  CORRELATION  RATIO  fa)* 

It  is  clear  that  if  the  regression  is  not  linear  r  ceases  to  be  a  satis- 
factory measure  of  the  relation  between  the  two  characters  under  con- 
sideration. In  an  extreme  case,  such  as  that  shown  in  the  accompanying 
diagram,  r  may  be  zero  while  there  is  yet  a  very  close  relation  between 
the  two  characters. 

Clearly,  if  the  individual  observations,  i.e.  the  dots  in  the  figure,  are 

*  See  Drapers'  Company  Research  Memoirs,  Biometric  Series,  n.  p.  9  et  seq.    Karl 
Pearson,  "On  the  Theory  of  Skew-Correlation  and  Non-Linear  Regression." 
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all  exactly  situated  on  the  regression  curve,  the  quantity  y  is  an  exact 
mathematical  function  of  x,  and  correlation  is  perfect,  or  rj  =  1,  where 
77  is  a  new  and  as  yet  undefined  measure  of  correlation,  called  the 
correlation  ratio;  while  if  the  individual  observations  are  much  scattered 
right  and  left  of  anyone  walking  along  the  regression  curve,  the  correla- 
tion is  imperfect,  and  17  <  1. 

If  there  is  no  scatter  at  all  in  any  array,  then  the  correlation  is 
perfect,  and  the  greater  the  scatter  the  less  the  correlation,  i.e.  the  less 
certain  is  any  prediction  of  y  from  x. 

Professor  Pearson  therefore  makes  the  correlation  ratio  rj  depend  on  the 
amount  of  scatter  in  the  arrays.  Exactly,  he  makes  it  depend  on  the  mean 
of  the  weighted  squares  of  the  standard  deviations  of  the  arrays,  i.e.  upon 

<,*  =  S(nxan*)IX. 


jsP^%. 


-*•  X 

Fig.  18 

The  correlation  ratio  rises  and  falls  as  this  quantity  falls  and  rises.  nx 
is  the  number  of  cases  in  the  array  of  which  crnx  is  the  standard  deviation. 
Clearly  the  unit  in  which  a^2  is  measured  for  this  purpose  must 
depend  on  the  standard  deviation  of  the  whole  of  the  y's.  If  the  arrays 
are  less  scattered  than  the  whole,  there  is  correlation.  If  there  is  no 
correlation,  any  array  will  have  just  as  much  scatter  as  the  whole  has. 
Pearson  writes 


. 

Compare  this  equation  with  that  found  above  for  the  case  of  linear 
regression,  p.  109,  namely 

"average"  mean  square  deviation  of  an  array  =  cr22  (1  —  r2), 
which  in  our  present  notation  is 
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and  from  the  comparison  we  see  that  Pearson  has  chosen  ??2  so  that  it 
becomes  r2  for  linear  regression.  The  correlation  ratio  77  however  never 
becomes  negative  but  is  in  linear  regression  numerically  equal  to  r. 

The  above  formulae  already  allow  of  the  calculation  of  77,  but  a 
simplification  is  possible.   By  its  definition, 


S'  being  a  summation  up  and  down  an  array,  so  that 


S  being  a  summation  at  right  angles  to  the  former,  i.e.  a  summation 
of  the  arrays.  That  is  to  say,  the  mean  of  the  weighted  squares  of  the 
standard  deviations  of  the  arrays  is  the  same  thing  as  the  mean  square  of 
the  distances  of  the  dots  (measured  in  the  y  direction)  from  the  regression 
curve.  This  simplifies  to 

NaJ  =  88'  (y*)  -  288'  (yxyx)  +  88'  (y*) 
=  N*2*-2S{yxS'(yx)}  +  S(nxyx*) 
=  Na*  -  2S  (yxnxyx)  +  S  (nxyx*) 
=  Na*  -  S  (nxyx*) 


= 

where  S2  is  the  standard  deviation  of  the  means  of  the  arrays,  each  array 
being  weighted  with  the  number  of  cases  in  it.   Therefore 


2  ,  2 

^    ~~  of"          ^ 


The  correlation  ratio,  therefore,  is  the  ratio  of  two  standard  deviations, 
one  of  the  means  of  the  arrays  (properly  weighted),  the  other  of  the  whole. 
Starting  from  the  other  variate,  we  arrive  in  a  similar  way  at  a 
second  value 


Since  r)  is  the  ratio  of  two  standard  deviations,  it  must  always  be  positive. 
Let  Y  be  the  ordinate  of  any  point  on  the  regression  line,  then  the 
average  of  the  sum  of  the  weighted  squares  of  the  distances  between 
the  regression  line  and  the  regression  curve 

S{nx(yx-Y)*} 

N 
which  reduces  to  cr22  (rf  —  r2). 
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Thus  r)  must  always  be  numerically  greater  than  r,  except  in  the 
case  of  linear  regression,  when  it  is  numerically  equal  to  r. 

In  examining  the  relationship  between  two  measurable  characters, 
f\  should  be  calculated  as  well  as  r,  since  it  serves  as  a  test  of  the  linearity 
or  non-linearity  of  the  regression,  and  is  also  a  better  measure  of  causal 
relation  than  r. 

A  simple  criterion  for  linearity  which  is  very  generally  applicable  is 
that 

VN 

•67449 ' 

For  very  exact  work,  more  complicated  formulae  need  to  be  em- 
ployed. 

The  results  obtained  above  are  all  independent  of  the  forms  of  dis- 
tribution of  the  variates. 

(3)  PROBABLE  ERRORS 

In  determining  means,  standard  deviations,  and  other  frequency 
constants,  the  investigator  is  unable  to  work  from  the  "total  population" 
and  must  be  content  with  the  results  obtained  from  "random  samples" 
of  greater  or  less  size  taken  from  this  (in  some  cases,  hypothetical)  total 
population. 

When  the  number  of  cases  (n)  in  the  random  sample  is  fairly  large — 
so  large  that  fractions  containing  certain  higher  powers  of  n  in  the 
denominator  can  be  neglected — the  probable  errors  are  found  to  be  as 
f  ollowsf : 

P.E.  of  a  mean  =  -67449  -^=, 

Vn 

„       „  <T= -67449-^, 

V2n 

I  —  r2 
r=- 67449 


Vn 

The  second  and  third  of  these  values  are  only  correct  when  the 
frequency-distribution  is  normal  or  approximately  normal.  In  parti- 
cular, for  large  values  of  r  the  true  P.E.  may  be  considerably  different 
from  that  given  by  the  above  formula  unless  the  distribution  is  normal. 

*l  2 

P.E.  of  77  =  -67449  — -J- ,  for  linear  regression,  and  also,  as  a  rough 

*  J.  Blakeman,  Biometrika,  TV.  pp.  349,  350. 

f  See  W.  Gibson,  "Tables  for  Facilitating  the  Computation  of  Probable  Errors," 
Biometrika,  iv.  p.  385  et  seq.  and  Pearson's  Tables. 

B.  &  T.  8 
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measure,  for  cases  of  skew  correlation.    If  greater  exactitude  is  needed 
in  the  latter  cases,  more  complicated  formulae  have  to  be  employed*. 
Another  frequency  constant  in  common  use  is    the  coefficient  of 

100(7 

variation  V,  which  =  -     —  . 
mean 

Its  P.E.  =  -67449  F  l  +  2    ? 


As  stated  above,  the  values  just  given  for  the  probable  errors  only 
apply  in  cases  where  n  is  fairly  large.  In  cases  where  n  is  so  small  that 
certain  higher  powers  of  its  reciprocal  cannot  be  neglected  in  comparison 
with  the  rest  of  the  expressions  involving  them,  the  values  cannot  be 
used.  For  such  cases  no  theoretical  formulae  have  hitherto  been  devised. 

An  empirical  investigation  has  however  been  madef  on  samples  of 
4,  8,  and  30  cases,  taken  from  a  "total  population"  of  3000  pairs  of 
measurements  (height  and  left  middle  finger  measurements  of  3000 
criminals;  "real"  correlation,  -66).  From  the  results  obtained  it  may 
be  concluded  that,  although  in  the  case  of  such  small  samples  as  4  or  8 
the  ordinary  formula  for  the  probable  error  of  r  gives  much  too  low  a 
value,  yet  in  the  case  of  as  many  as  30,  the  formula  applies  with  tolerable 
accuracy.  We  must,  however,  bear  in  mind  that  this  result  has  only 
been  proved  (empirically)  to  hold  in  the  single  case  when  the  actual 
correlation  was  -66. 

The  calculation  of  the  probable  errors  of  means,  standard  devia- 
tions, coefficients  of  variation,  and  coefficients  of  correlation  is  very 
much  facilitated  by  the  use  of  Pearson's  Tables  for  Statisticians,  es- 
pecially Tables  V,  VI,  VII  and  VIII,  calculated  by  members  of  the 
staff  §  of  the  Biometric  Laboratory,  University  College,  London. 

We  give  next  an  example  of  the  evaluation  of  r  and  of  77  between 
speed  of  adding  single  digits  and  accuracy  in  doing  so,  the  individuals 
measured  being  86  boys  between  the  ages  of  11  and  12  years  from  two 
L.C.C.  elementary  schools.  The  two  groups  could  be  thrown  together 
for  this  purpose,  since  the  means  and  standard  deviations  calculated 
from  them  separately  were  in  very  close  agreement  —  well  within  the 
limits  of  the  probable  errors. 

*  Karl  Pearson,  op.  cit.  (Biometric  Series,  n.)  p.  19. 

t  Calculated  values  for  different  values  of  n  given  in  Gibson's  Tables,  see  pp.  xxii 
and  18  of  Pearson's  Tables. 

J  "Student"  :  "The  Probable  Error  of  a  Coefficient  of  Correlation,"  Biometrika,l908^-  9, 
vi.  p.  302. 

§  Miss  Winifred  Gibson,  Dr  Raymond  Pearl,  T.  Blakeman,  Dr  David  Heron,  Miss 
H.  Gertrude  Jones,  H.  E.  Soper. 
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Frequency 

x' 

Frequency 
xz' 

Frequency 
xx'2 

Frequency 

y' 

Frequency 
xt,' 

Frequency 
x^'2 

3-5 

-3 

10-5 

31-5 

6 

-5-5 

33 

181-5 

18-5 

-2 

37 

74 

4 

-3 

12 

36 

14-5 

-1 

14-5 

14-5 

3 

-2 

6 

12 

18 

0 

-62 

— 

9 

-1 

9 

9 

15-5 

1 

15-5 

15-5 

18 

0 

-60 

— 

10-5 

2 

21 

42 

25 

1 

25 

25 

2-6 

3 

7-5 

22-5 

21 

2 

42 

84 

3 

4 

12 

48 

S6=N 

+  67 

347-5 

86  =# 

+  56 

248 

+  7 

-6 

^=-^= -0-06977, 


2  = 


248 


86 

=  2-7955, 
=  1-67. 


JL! 

12 


=  86  =  0-0814, 

_  347-5      j  24. 

86 

=  4-0341, 
=  2-013. 


Frequencies 

*vt 

Total 
frequency  / 

/x*y 

1+6-5  -4-5  -1-5 

1 

0-5 

+  0-5 

2-5+0-5  +2-5  +  3  -4-5  -5  -2 

2 

-3 

-6 

1+0-5+0-5-1 

3 

1 

3 

2+2-5+0-5-2-5 

4 

2-5 

10 

0-5-1 

5-5 

-0-5 

-2-75 

1+2-1 

6 

2 

12 

0-5 

8 

0-5 

4 

1 

9 

1 

9 

3 

11 

3 

33 

-1 

22 

-1 

-22 

71-5  -30-75 

S(x'y')=  40-75 

*  Sheppard's  correction.   Remember  that  (72=ju2,  and  ^="1';  8ee  above,  p.  84. 
f  Sheppard's  correction  cannot  be  used  here,  since  the  units  of  the  subgroups  are 
not  equal  and  there  is  not  high  contact  at  the  ends  of  the  frequencies. 
%  The  figures  in  italics  in  the  correlation  table. 
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S  (xy)  =  S  (x'y')  -  Nd^d2 

=  40-75  +  86  x  -0057 
=  41-24, 

41-24 


Naja2      86  x  1-67  x  2-013 
P.E.  =  -67449  .  i-Tr  =  0-071. 


.*.    ^speed  of  addition  =  0'14  ±  0'07. 

aco.  of  addition 


yx  -y 

(yx-y?*nx 

-0-224 

•050176 

x  3-5  = 

•175616 

-  1-081 

1-168561 

x  18-5  =21-618379 

0-488 

•238144 

x!4-5  = 

3-453088 

0-322 

•103684 

x!8  = 

1-866312 

0-145 

•021025 

x!5-5  = 

•325888 

0-490 

•2401 

x!0-5  = 

2-52105 

1-719 

2-954961 

x  2-5  = 

7-387403 

-1-414 

1-999396 

x  3  = 
-y)}*=4 

5-998188 

fo   •j^     (lly   $7)    > 

43-345924 
-86x4-034 
/.   77  =  0-353, 
P.E.  =  -67449  (1  - 
=  0-064. 

•'.    ^speed  of  addition  =  0'35  ±  0*06. 
ace.  of  addition 

Calculating  t\  from  the  means  of  the  ^-arrays,  we  have 

2      S{ny(xy-x)*} 
11  "  NaJ 

19-68101 


86  x  2-796 
=  -0819. 
/.  7?  =  0-29  ±  0-07. 

To  test  the  value  of  rj  obtained  from  the  means  of  the  a?-arrays,  for 
linear  regression: 

VN       lA/-2rr-ySJ:_        -323 
•67449  '      ^  2  x  -07273 

=  2-22,  i.e.   <  2-5. 

Hence  regression  may  be  considered  to  be  linear. 

*  See  p.  113. 
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Regression  coefficients: 


=  r^  =  -118, 


Equations  to  regression  lines  are 

x  -  x  =  bl2  (y  -  y), 


and 


i.e. 


equation  to  regression  line  AB  is 

y  -  164-2  =  -172  (x  -  237-7), 
y  =  -172z  +  123-316 


237-7  (mean  speed) 


(i). 


140          180         220          260         800          340          380 
speed  (no.  of  figures  added  in  10  minutes) 

Fig.  19 

Similarly  the  equation  to  the  line  CD  is 

x  -  237-7  =  -118  (y  -  164-2), 
i.e.  x  =  -Il8y  -  218-324  (ii). 

Equation  (i)  gives  the  most  probable  value  of  y  associated  with  a 
given  value  of  #,  with  a  standard  error 

a2V(l-r2),  i.e.  1-991. 

Similarly,  mutatis  mutandis,  with  equation  (ii). 

As  a  model  of  the  proper  use  of  the  correlation  coefficient  and  ratio 
we  may  cite  a  very  detailed  investigation  into  the  relationship  of  in- 
telligence to  the  size  and  shape  of  the  head,  and  to  other  physical  and 
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mental  characters,  by  Prof.  Karl  Pearson*,  which  appeared  in  Bio- 
metrika,  v.  1906 — 1907.  The  subjects  measured  were  1000  Cambridge 
undergraduates  and  considerably  more  than  5000  school-children.  Special 
care  was  taken  in  drawing  up  a  quantitative  scale  of  intelligence,  ad- 
justments being  made  so  that  the  results  fitted  a  "normal"  or  Gaussian 
distribution.  The  correlations  were  worked  out  in  several  different  ways, 
— correlation  coefficient  (r),  correlation  ratio  (17),  coefficient  of  mean 
square  contingency!,  and  the  method,  first  suggested  in  this  paper,  of 
the  analograph.  Pearson  describes  this  last  method  as  follows:  "In  the 
case  of  intelligence,  I  take  a  normal  scale  as  my  base  line  and  plot  up 
the  percentage  of  the  character  for  each  grade  of  intelligence  along  the 
centroid  vertical  of  the  corresponding  range,  drawing  a  horizontal  line 
to  represent  the  mean  percentage  in  the  population  at  large.  We  thus 
obtain  a  diagram,  which  I  will  venture  to  call  an  analograph. 

"  If  the  percentage  increases  or  decreases  continually  with  intelligence 
(or  with  the  base  character,  whatever  it  may  be),  I  term  the  relationship 
homoclinal;  if  the  percentage  does  not  reach  its  maximum  with  the 
maximum  or  minimum  of  intelligence,  I  term  the  diagram  heteroclinal" 

(4)  THE  NORMAL  CORRELATION  SURFACE  AND  ITS  PROPERTIES 

We  have  so  far  considered  the  correlation  coefficient  r  from  the  point 
of  view  from  which  it  was  approached  by  Galton,  who  measured  it  by 
the  inclinations  6^  and  02  of  what  we  have  called  the  regression  lines, 
according  to  the  formulae 

r  =  ^  tan  0X  =  ^  tan  02  =  V(tan  dl  tan  02). 

(T2  (Tj 

We  followed  this  up  by  an  application  of  the  Method  of  Least  Squares, 
first  made  by  Mr  Udny  Yule,  obtaining  the  more  advantageous  formula 

_ 
~ 

known  as  the  Bravais-Pearson  Product-Moment  Formula.  All  we  have 
so  far  done  is  independent  of  the  form  of  distribution  of  the  correlated 
variates. 

From  a  historical  point  of  view  however  this  is  not  quite  the  way  in 
which  the  subject  developed.  In  1846  A.  Bravais  published,  in  Vol.  ix 
of  the  Memoires  de  Vlnstitut  de  France,  an  article  entitled  "Analyse 
mathematique  sur  les  probabilites  des  erreurs  de  situation  d'un 

*  Karl  Pearson:  "On  the  Relationship  of  Intelligence  to  Size  and  Shape  of  Head,  and 
to  other  Physical  and  Mental  Characters,"  Biometrika,  1906—1907,  v.  pp.  105—146. 
See  below. 
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point,"  which  was  really  a  very  complete  study  of  normal  correlation, 
although  this  is  not  perhaps  obvious  to  anyone  casually  glancing  through 
the  volume.  (He  uses  the  word  "correlation"  itself,  however,  on  p.  263.) 

Galton's  work  was  done  in  ignorance  of  this  article,  and  of  course 
was  work  applied  directly  to  certain  social  and  anthropological  problems, 
whereas  Bravais'  is  a  piece  of  mathematics  only.  Also  in  ignorance  of 
Bravais'  work,  Professor  F.  Y.  Edgeworth*  took  some  important  steps 
on  the  road  later  followed  by  Professor  Karl  Pearson,  who  connected 
Galton's  work  with  Bravais',  adopted  the  product- moment  formula 
which  was  implicit  in  the  latter's  equations,  and  showed  that  it  is  the 
best  formula  for  the  purpose,  i.e.  it  has  the  least  probable  error. 

Mr  Udny  Yule  still  later  pointed  out  the  simple  method  of  arriving 
at  this  formula  which  we  have  used  in  an  article  of  the  Proceedings  of 
the  Royal  Society  for  1896 — 7,  the  chief  importance  of  which  is  that  it 
points  out  the  fact,  then  first  adequately  realised,  that  the  product- 
moment  formula  has  a  definite  significance  even  if  the  distribution  of 
errors  is  not  normal. 

In  the  present  section  we  shall  now  consider  more  definitely  normal 
correlation. 

If  the  x  variate  is  normally  distributed  according  to  the  law 

1  -£ 

and  the  y  variate  according  to  the  law 

-e~^  (2), 


then  the  probability  of  simultaneous  occurrence  of  a  value  x  and  a  value  y  is 

_  i  (±  +  yl\ 

P  =  9-    -e    2^    'Vdxdy  ......  (3), 

ZTTOjOg 

provided  x  and  y  are  independent  or  uncorrelated.   If  however  they  are 
correlated,  this  probability  is 


which  reduces  to   the  former  expression  when  r,  the  coefficient  of 
correlation,  becomes  zero.   The  surface 

z^P'jdxdy  ......  (5) 

*  Phil.  Mag.  1892  and  1893,  several  articles.  In  a  paragraph  buried  in  one  of  these, 
indeed,  Professor  Edgeworth  reached,  but  did  not  realise  the  importance  of,  the  product- 
moment  formula,  which  he  there  gives  as  the  best  formula  (Phil.  Mag.  July  1893,  Series  5, 
xxxvi.  on  p.  100). 
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is  the  normal  correlation  surface,  and  is  a  hillock  shaped  like  a  bell 
with  an  oval  mouth. 

When  two  variates  are  recorded  on  a  grid-iron  table  like  that  used 
in  the  above  example  of  Speed  of  Addition  and  Accuracy  of  Addition, 
the  resulting  table  is  called  a  "correlation  table*."  As,  owing  to  the 
experimental  difficulties  of  the  subject,  psychological  correlation  tables 
are  seldom  smooth  enough  to  illustrate  vividly  the  points  about  to  be 
mentioned,  we  quote  here  a  correlation  table  from  the  study  of  heredity 
which  has  already  been  used  elsewhere  for  a  similar  purposef. 

Father's  Stature 


Inches 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

60 

*'  ' 

2 

2 

4 

61 

2 

4 

62 

1 

1 

2 

4 

1 

1 

2 

2 

63 

. 

1 

i 

9 

9 

8 

16 

20 

11 

5 

. 

1 

1 

64 

4 

. 

6 

15 

12 

17 

32 

37 

12 

5 

6 

3 

5 

65 

8 

4 

2 

8 

13 

38 

54 

43 

30 

22 

14 

10 

66 

. 

2 

4 

9 

21 

38 

40 

67 

70 

64 

21 

8 

10 

4 

67 

6 

8 

19 

14 

55 

79 

106 

103 

78 

50 

55 

13 

2 

4 

68 

6 

8 

30 

40 

41 

97 

126 

94 

118 

53 

34 

38 

9 

69 

4 

. 

21 

20 

51 

73 

64 

96 

116 

86 

40 

14 

9 

4 

70 

4 

10 

23 

75 

47 

78 

90 

78 

58 

25 

14 

6 

4 

71 

13 

20 

35 

43 

76 

59 

83 

43 

32 

20 

4 

4 

72 

1 

12 

5 

28 

31 

43 

45 

40 

34 

11 

2 

73 

3 

3 

10 

30 

26 

24 

30 

25 

13 

2 

2 

74 

4 

6 

6 

21 

9 

10 

26 

13 

13 

8 

75 

4 

8 

10 

3 

7 

2 

76 

5 

1 

2 

4 

4 

77 

5 

1 

4 

6 

78 

4 

4 

1 

3 

79 

1 

1 

For  simplicity  in  printing,  the  numbers  in  the  original  table  in  Biometrika,  1902 — 3, 
n.  p.  415,  have  been  multiplied  by  four:  this  eliminates  quarters  and  halves  which  occur 
through  some  heights  being  half-way  between  whole  inches. 

*  It  is  very  unfortunate  that  Mr  J.  C.  Maxwell  Garnett  has  recently  used  this  term 
(already  fixed  in  the  meaning  given  in  the  text)  for  something  quite  different.  Proc.  Roy. 
Soc.  1920,  xcvi.  A,  p.  100. 

f  E.g.  Mr  Udny  Yule's  text-book  on  the  Theory  of  Statistics;  and,  independently, 
by  G.  H.  Thomson,  "Mathematics  and  the  Inductive  Methods  of  Logic,"  Proc.  Univ. 
Durham  Phil.  Soc.  1912—13,  v.  pp.  76—99. 
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It  is  interesting  to  look  at  a  correlation  table  in  greater  detail.  If 
we  think  of  it  as  a  plane  horizontal  surface,  and  erect  over  the  centre 
of  each  compartment  a  vertical  line  proportional  to  the  number  written 
in  that  compartment,  then  the  tops  of  these  lines  touch  the  correlation 
surface. 

It  is  clear  from  the  equation  to  the  surface  that  the  contours,  or 
lines  of  equal  2,  are  ellipses,  given  by  the  equation 


2rxy      y2 
-^- 


+  -^-a  =  constant 


(6). 


In  the  figure  the  numbers  40  and  over  have  been  printed  in  italics  so 
that  this  contour  line  can  be  approximately  followed.  It  is  seen  to  be 
roughly  elliptical,  the  major  axis  of  the  ellipse  lying  obliquely.  The 
major  axes  of  all  the  contour  ellipses  of  a  surface  showing  correlation 
are  inclined  to  the  axes  of  coordinates;  and  if,  as  we  always  can,  we 
choose  the  linear  units  of  x  and  y  on  the  diagram  in  such  a  ratio  that 


Fig.  20 


Fig.  21 


Q!  =  cr2,  then  this  inclination  will  be  at  45°,  as  shown  in  the  accompanying 
figure  (Fig.  20). 

In  the  case  of  no  correlation,  r  =  0,  and  if  o^  =  cr2  the  equation  (6) 
given  above  for  the  contour  lines  represents  a  circle.  In  this  case, 
therefore,  with  suitable  units  of  x  and  y,  the  contours  are  circles,  and  the 
correlation  surface  is  a  perfectly  symmetrical  hillock.  As  correlation 
increases,  the  contour  lines  become  more  and  more  drawn  out  along 
the  45°  line  (or  the  135°  line  for  negative  correlation),  as  suggested 
in  the  figure  (Fig.  21). 

In  a  contoured  plan  of  a  correlation  surface,  those  lines  through  the 
origin  are  important  which  cross  all  contour  lines  at  points  where  the 
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tangents  to  the  contour  lines  are  parallel  to  the  axes  of  coordinates. 
Such  are  L^L^  and  L2L2  in  Fig.  20.  If  we  differentiate  equation  (6) 
with  regard  to  x  and  equate  to  zero  we  shall  obtain  L2L2.  We  find 

2x      2ry  _ 


or  simplifying 
Similarly  the  equation  of 


°i        ^2 
is 

1=  ,£ 


That  is,  these  are  the  regression  lines. 

We  have  said  that  any  horizontal  section  of  a  normal  correlation 
surface  is  an  ellipse.  Any  vertical  section,  on  the  other  hand,  is  a  normal 
probability  curve.  Consider  first  a  vertical  section  parallel  to  the  z-axis. 
Write  y  =  c  in  equation  (4). 

After  a  little  simple  algebraical  arrangement  this  then  reduces  to  the 
form  a  , 

")    -  («  -»  ?.)'/ 


This  is  a  normal  curve,  with  area 


Its  centre  is  at  a;  =  rco^/crg,  y  =  c,  i.e.  on  the  regression  line.  Its 
standard  deviation  is  c^  \/(l  —  r2)  and  is  independent  of  c. 

Similar  statements  hold  for  a  vertical  section  along  a  line  x '—  c', 
the  constant  standard  deviation  being  a2\/(l  ~  r2)- 

A  similar  procedure  will  show,  with  rather  more  cumbrous  algebra, 
that  any  vertical  section  is  a  probability  curve. 

(5)  OTHER  METHODS  OF  DETERMINING  CORRELATION 
1.   FOURFOLD  TABLE. 


*i 

*2 

2/i 

a 

b 

a  +  b 

y* 

c 

d 

c  +  d 

a  +  c 

b+d 

N 
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(a)  When  the  divisions  pass  through  the  means  of  both  characters, 

.    TT  (a  —  b) 

r  =  sm  ^  ~ fi  . 

2  (a  +  b) 

This  formula  (Sheppard's)  is  of  little  practical  use,  since  the  mean 
values,  in  cases  where  the  fourfold  table  is  the  only  method  which  can 
be  used,  are  generally  unknown. 

(b)  In  cases  where  the  dividing  lines  do  not  pass  through  the  means 
(means  not  known)  the  equations  for  determining  the  correlation  are : 


a 


Fig.  22 


and 


d)         /§|»     *-, 

.)  "  V  7rJ0 


N  (the  total  no 

(the  cross-hatched  area  on  the  upper  curve) 


(the  cross-hatched  area  on  the  side  curve) 

from  which  h  and  k  can  be  obtained  by  Sheppard's  Tables  ;  and  finally 
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~h 


3)  (k*  ~ 


3) 


hh*~  10*2 


1  5040 
+  etc., 


-  15)  (AJ8  - 


-  15) 


where 


In  obtaining  these  equations,  a  normal  correlation  surface  is  assumed, 
with  the  equation  : 

2rxy 


2?r(r1o>2  Vl  —  f2 

The  probable  error  of  r  obtained  by  the  fourfold  table  method  is 
much  larger  than  that  given  by  the  formula  of  p.  113.  The  correct 
formula  is  too  complicated  to  insert  heref. 

2.   METHOD  OF  CONTINGENCY:]:. 

The  following  is  an  example  of  a  contingency  table  §. 

Fathers 


Merry 

Melancholy 

Alternating 

Even 

Totals 

Merry 
Melancholy 
Alternating 
Even 

122 
10 
70 
58 

8 
2 
9 
6 

81 
7 
101 
66 

67 
10 
68 
45 

278 
29 
248 
175 

Totals      ... 

260 

25 

255 

190 

730 

*  All  this  is  rendered  much  easier  by  P.  F.  Everitt's  "  Tables  of  Tetrachoric  Functions," 
Biometrika,  vn.  pp.  437 — 451,  or  Table  XXIX  in  Pearson's  Tables. 

t  Unless  the  dichotomies  are  extreme,  good  values  are  obtained  from  the  use  of 
Tables  XXIII  and  XXIV,  Pearson's  Tables. 

J  Karl  Pearson,  "  On  the  Theory  of  Contingency  and  its  Relation  to  Association  and 
Normal  Correlation,"  Drapers'  Company  Research  Memoirs,  Biometric  Series,  i.  1904; 
Dulau  and  Co.,  London. 

§  Taken  from  a  paper  by  E.  Schuster  and  E.  M.  Elderton  on  "The  Inheritance  of 
Psychical  Characters  (being  a  further  Statistical  Treatment  of  Material  Collected  and 
Analysed  by  Messrs  G.  Heymans  and  E.  Wiersma),"  Biometrika,  1906—1907,  v.  pp. 
460—469. 
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The  method  is  employed  when  the  grouping  is  merely  by  class  and 
the  different  classes  have  no  known  relation  to  one  another — in  other 
words,  when  the  grouping  is  merely  qualitative.  The  order  of  the 
different  qualities  can  be  changed  without  making  any  difference  to 
this  method. 

The  relationship  between  the  two  variables  is  measured  by  the 
differences  between  the  numbers  actually  found  in  the  various  com- 
partments of  the  table,  and  the  numbers  that  might  be  expected  there 
by  pure  chance. 

To  state  the  rule : 

The  total  mean  square  contingency,  <£2,  of  the  table  is  given  by 


1,  (("»-¥ 


f-yS, 

tf 

where  nv  =  total  frequency  in  pth  row, 

nff   =  total  frequency  in  qih  column, 

n^  =  frequency  of  constituent  common  to  pth  row  and  ^th  column, 
N   =  total  number  of  cases  in  the  table. 
Then  the  coefficient  of  mean  square  contingency  C1  is: 

<?i 

If  it  is  assumed  that  a  normal  distribution  underlies  the  classification, 
and  if  the  fineness  of  grouping  is  right,  then  the  coefficient  C1  is  nu- 
merically equal  to  the  correlation  coefficient  r. 

In  the  above  case,  Ct  =  0-16. 

The  probable  error  of  Cl  is  very  complicated*)-. 

For  (?!  =  0,  P.E.  =  -67449/vW. 

Instead  of  calculating  the  mean  square  contingency,  it  is  easier  though 
not  so  accurate  to  calculate  the  mean  contingency.  Each  quantity 


N 

*  There  are  certain  corrections  to  #a,  not  mentioned  here,  which  often  make  con- 
siderable difference  to  the  result.  See  K.  Pearson,  F.R.S.,  "On  the  Influence  of  Broad 
Categories  on  Correlation,"  Biometrika,  1913,  ix.  pp.  116 — 139.  The  important  point  is  to 
have  fine  enough  grouping,  but  not  so  fine  as  to  leave  cells  with  very  few  or  no  cases 
in  them. 

f  J.  Blakeman  and  Karl  Pearson,  "On  the  Probable  Error  of  Mean  Square  Contin- 
gency," Biometrika,  1906,  v.  pp.  191 — 197.  A.  W.  Young  and  Karl  Pearson,  "On  the 
Probable  Error  of  a  Coefficient  of  Contingency  without  Approximations,"  Biometrika, 
1916,  xi.  pp.  215—230. 
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is  called  a  subcontingency,  and  it  will  be  observed  that  in  the  formula 
for  <f>2  these  were  squared.  Instead  of  this,  let  us,  without  squaring 
them,  add  the  positive  subcontingencies  only  (for  of  course  the  sum  of 
the  whole  is  zero),  and  write 


From  $,  by  using  Table  XXXIV  of  Pearson's  Tables,  a  value  of 
the  second  coefficient  of  contingency  C2  is  read  off,  which,  under  condi- 
tions similar  to  those  outlined  above,  also  is  equivalent  to  r. 

The  contingency  method,  it  must  again  be  emphasised,  gives  these 
two  measures  of  the  connection  or  association  of  the  qualities  considered 
even  without  any  assumption  that  a  continuous  variation  underlies 
the  discrete  classification.  If  however  such  is  assumed,  then  the  approach 
to  equality  of  C±  and  Cz  will  be  a  good  measure  of  the  normality  of  the 
distribution  and  the  suitability  as  to  smallness  of  our  elements  of 
grouping.  With  very  fine  grouping  we  get  into  difficulties  owing  to  having 
to  record  by  units  only.  16  to  25  subgroups  is  a  good  range. 

It  is  interesting  to  find  that  areas  of  positive  contingency  are  sepa- 
rated from  areas  of  negative  contingency  on  a  normal  surface  by  a 
hyperbola  having  a  simple  relationship  with  the  contour  ellipses. 

3.  TWO-BOW  TABLE*. 

This  method  gives  a  unique  value  of  r  in  the  case  of  two  variates 
one  of  which  is  both  quantitative  and  continuous  (e.g.  intelligence), 
while  the  other,  though  quantitative,  admits  of  only  two  subdivisions 
(e.g.  into  good  and  bad  visualisers),  or,  in  more  technical  language,  is 
"alternative." 

-*•  x  (intelligence) 


Good 

visualisers 

Bad 

visualisers 

N 

*  Karl  Pearson,  F.R.S.,  Biometrika,  1909,  vn.  p.  97. 
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The  assumptions  made  are  two  in  number: 

(1)  that  the  regression  is  linear: 

(2)  that  the  distribution  of  the  alternative  variate  is  approxi- 
mately normal  or  Gaussian. 

The  accompanying  diagram  (Fig.  23)  will  make  the  method  clearer. 
LL'  is  the  regression  line  of  x  on  y,  its  equation  being 

x  -  x  =  r  ^  (y  -  y), 


X  —  X 

.*.   r  = 

<TI 

y-y' 

—  »* 

** 

R 

* 

n     A       y 

L  \ 

\ 

&k 

Good 

\ 

\ 

\ 
\ 
\ 

I         i 

%  s 

Bad 

—  ^ 

visualisers 

\ 
\ 

\ 

\ 
\ 

\ 

/ 

/'' 

Fig.  23 

Putting  x,  the  mean  of  all  the  x'a,  =p,  x  =  p',  the  mean  value  of 
the  array  of  sc's  corresponding  to  the  smaller  group  of  y's  (good  visual- 
isers), and  y  —  y  =  q,  the  distance  of  the  centroid  of  the  area  J?$T  from 
the  mean  MM  of  the  Gaussian  curve,  we  have  the  formula  : 


The  numerator  of  this  is  known,  and  the  denominator 


f°°      ~2 

y°Le 


a2  x  area  of  part  jftS 
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where  nt  =  no.  of  cases  in  upper  group  (area  RST), 

and  N  =  total  no.  of  cases. 

In  this  expression  ^  =  |  (1  +  a)  in  Sheppard's  Tables*  from  which  the 
corresponding  value  of  —  can  be  read ;  and  the  value  of  z  corresponding 

°2 

to  this  latter  result  gives       1  / 

1  ,~"iW 


r  = 


p-p 

1 


the  value  of  z  in  Sneppard  s  Tables  correspondingx 
tothei(l-«)of|  ) 

The  above  method  might  advantageously  be  used  in  determining 
the  relationship  of  two  characters,  such  as  those  suggested  (intelligence 
and  visual  imagery),  which  are  both  quantitative,  but  of  which  one 
only  admits  of  reliable  division  into  two  groups.  It  is  the  smaller  of 
these  groups  (the  "tail"  of  the  distribution)  which  gives  the  n  of  the 
formula.  In  the  suggested  case,  it  might  be  represented  by  the  number 
of  individuals,  out  of  the  entire  group  N  measured,  who  successfully 
pass  certain  somewhat  difficult  tests  of  visual  imagery. 

In  cases  where  the  #-variate  (continuous  and  quantitative  in  our 
example  above)  can  only  be  divided  into  classes,  showing  no  definite 
order  or  quantitative  relations  to  one  another,  the  ^/-variate  being  again 
quantitative  and  assumed  to  follow  a  normal  distribution,  but  alter- 
native, a  modification  of  the  above  method  gives  77. 

4.   SHORT  METHODS|. 

(i)    It  can  easily  be  shown  that 


and  therefore  that         rxy  =  -^—  — —  . 

Z(7X(7y 

(ii)    If  the  distributions  are  both  normal,  and  if  both  variates  have 
the  same  meaning,  and  the  same  standard  deviation  (7,  then 


*  BiometriJca,  n.  p.  182;  Table  II  of  Pearson's  Tables. 

t  Karl  Pearson,  "  On  Further  Methods  of  Determining  Correlation,"  Drapers'  Company 
Research  Memoirs,  Biometric  Series,  iv.  1907. 
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where  S  is  the  sum  of  the  positive  differences  only.  This  method  might 
sometimes  be  conveniently  used  in  determining  the  "individual"  corre- 
lation between  performances  of  the  same  individuals  in  the  same  mental 
tests  on  different  occasions. 

5.   THE  METHOD  OF  RANKS. 

Some  years  ago  the  valuable  suggestion  was  made  by  Professor 
C.  Spearman*  that  measurements  of  psychical  performance  may  con- 
veniently —  nay,  preferably  —  be  replaced  by  the  numbers  representing  the 
rank  or  order  of  merit  of  the  individuals  in  the  group.  On  this  basis 

the  ordinary  product-moment  formula  for  r,  A7/  ^'  ,  can  be  easily  shown 

lYCTjCJo 

to  reduce  to  the  form  gg  /     _  y  >2 


N  (N*  -  1)  ......     ' 

where  vl  and  v2  are  the  ranks  of  an  individual  in  the  two  series. 

Professor  Spearman  also  suggested  a  still  simpler  formula,  which  he 
calls  a  "foot-rule"  formula.  It  is 


P        1 

- 

where  S  (g)  denotes  the  sum  of  the  "gains"  in  rank  (sum  of  positive 
differences)  of  the  second  series  on  the  first,  and  then  empirically,  by 
noting  the  distribution  of  a  large  number  of  chance  values  of  S  (g), 


-  sing  R) 


(r). 


This  method  of  using  ranks,  and  the  formulae  suggested  therefor, 
were  vigorously  criticised  by  Karl  Pearson  in  the  paper  quoted  on  the 
preceding  pagef. 

Some  form  of  frequency  distribution  must  be  assumed,  and  the 
"foot-rule"  method  assumes  that  form  to  be  a  rectangle.  On  the  assump- 
tion of  normal  distribution,  Professor  Pearson  shows  that 


......  (S), 

where  p  has  the  value  given  above. 

*  C.  Spearman:  "Measurement  of  Association  between  Two  Things,"  Am.  Journ. 
Psychol.  1904.  xv;  "'Foot-rule'  for  Measuring  Correlation,"  Brit.  Journ.  of  Psychology,  n. 
Pt.  I,  July  1906.  Were  it  possible  to  keep  R  in  its  place  merely  as  a  "foot  -rule"  whose 
"chief  mission  is  to  gain  quickly  an  approximate  valuation  of  r,"  it  would  not  be  harmful. 
But  the  ease  of  its  calculation  leads  to  its  use  too  frequently  "not  merely  for  assay  purposes 
as  originally  contemplated,  but  even  sometimes  for  research"  (see  C.  Spearman,  Brit. 
Journ.  of  Psychol.  1910,  m.  p.  286). 

f  Professor  Spearman  endeavours  to  meet  some  of  these  criticisms  in  Brit.  Journ. 
of  Psychol.  1910,  in.  p.  271  sqq. 
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In  terms  of  the  sum  of  positive  differences  of  ranks  ("gains"  in  rank) 
the  formula  is 


=  2  cos|(l  -  R)  -  I  (€). 

This  agrees  very  closely  with  Spearman's  formula  (y),  and,  having 
a  definite  theoretical  basis,  should  now  take  its  place. 

Professor  Pearson  has  proved  that  the  probable  error  of  p  for  zero 

•67449 

correlation  =  ~^=  .    He  is  thus  able  to  show  that,  for  zero  correlation. 
Vn— 1 

,  ,  ,  -7063 

P.E.  of  r  deduced  from  p  =  , 

V  n  —  1 

•7738 
and  P.E.  „  r        „          „     R  =     

Vn-l 

While  this  book  is  going  through  the  press,  Professor  Spearman  has 
very  kindly  sent  on  to  us  the  proof  sheets  of  portions  of  a  work  on 
correlation  by  Professor  Wirth  of  Leipzig,  in  which  considerable  space 
is  devoted  to  the  method  of  ranks,  of  which  the  author  highly  approves 
and  which  he  ingeniously  develops  in  several  ways.  Although  we  cannot 
completely  share  this  approval,  we  can  appreciate  the  thorough  exposi- 
tion given  and  especially  the  clear  distinction  drawn  between  (1)  the 
stage  of  complete  measurement,  (2)  the  stage  of  ranks  in  which  classes 
are  arranged  in  order,  and  (3)  the  stage  of  contingency,  in  which  classes 
of  x  can  be  assigned  to  classes  of  y,  without  however  being  in  any  order 
of  magnitude. 

This  shows  clearly  the  position  which  rank  methods  take  up.  The 
controversy  concerning  these  methods  really  falls  into  two  parts, 
(1)  questions  of  principle  and  (2)  questions  of  mathematical  detail, 
often  of  primary  importance.  Leaving  the  latter  aside  however,  let  us 
consider  Professor  Spearman's  claim  that  ranking  is  superior  to  measure- 
ment in  psychology  because  the  things  measured  are  not "  homogeneous," 
for  example  the  skin's  "spatial  sense"  and  its  sensitivity  to  pain*.  In 
short,  Professor  Spearman  raises  the  whole  question  of  mental  abilities 
being  "magnitudes"  or  true  "quantities"  (see  Chap.  I,  p.  11). 

We  would  reply  that  there  is  much  truth  in  what  he  says.    But 
(1)  for  practical  purposes  the  correlations  of  the  physical  signs  of  mental 
states  are  themselves  often  important,  whether  or  no  they  are  the  true 
*  Brit.  Journ.  of  Psychol  1906,  n.  p.  91. 
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correlations  of  the  mental  states  as  such,  and  (2)  the  physical  measure- 
ment often,  at  least,  goes  further  than  giving  the  mere  order  of  merit 
of  the  mental  states.  It  frequently  ranks  the  differences  also  (cf.  p.  12). 
Indeed,  in  many  cases,  such  as  the  giving  of  examination  marks,  there 
is  a  definite  attempt  to  do  more  than  say  that  this  is  the  order  of  merit 
of  the  competitors  —  there  is  an  attempt  to  say  something  like:  —  "This 
candidate  is  easily  first,  the  second  and  third  are  close  together,  the 
fourth  a  long  way  behind,"  and  so  on.  While  we  are  prepared  to  agree 
that  actual  marks,  in  an  examination  or  a  test,  are  hardly  likely  to  be 
true  measures  of  the  desired  mental  state,  yet  we  claim  that  they  show 
often  a  good  deal  more  than  mere  ranks.  Mental  measurement  need 
not  remain  at  the  second  of  Professor  Wirth's  levels,  even  though  it 
may  be  long  before  it  can  reach  the  first. 

(6)  CORRELATION  OF  SUMS  OR  DIFFERENCES* 
This  article  is  concerned  with  the  following  problem.  After  calcu- 
lating the  correlations  between  several  series  of  values,  it  frequently 
happens  that  we  want  the  correlations  given  by  some  of  the  series  added 
together;  and  differences  are  not  less  important  than  sums.  The  corre- 
lation of  the  pool  is  not  the  mean  of  the  correlations.  The  general  problem 
is  as  follows. 

Let  the  two  series  of  values  be  denoted  by  al9  a2  ...  av,  and 
blt  b2  ...  bg,  each  being  measured  from  its  own  mean  and  consisting  of 
N  cases.  Let  these  variates  be  multiplied  by  constants  or  weights. 
Required  the  correlation  between 

A  =  %%  +  n2a2  +  ...  -f  n9aV9 


and  B  =  m^  +  m2b2  +  ...  +     gQ. 

Since  all  the  a's  and  6's  are  measured  from  their  means,  it  is  clear  that 
A  and  B  are  also  so  measured.   The  required  correlation  is  therefore 

S'  (AB) 


_ 

m 


where  the  symbol  S'  indicates  summation  from  1  to  2V.  We  shall  retain 

the  symbol  S,  on  the  other  hand,  for  summation  from  1  to  p  or  1  to  q. 

Consider  now  the  correlation  of  any  particular  a  with  any  particular  b. 

It  is  given  by 


That  is,  Najjfa,  =  8'  (db). 

*  C.  Spearman,  Brit.  Journ.  of  Psychol.  1913,  v.  p.  417. 
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Multiplying  the  a  by  its  constant  n,  and  the  b  by  its  constant  m,  only 
alters  the  standard  deviations  of  these  quantities,  not  their  means, 
since  they  are  already  measured  from  means.  We  have  therefore 

Nnmoracrbrab  =  S'  (na.mb), 

and  summing  this  over  the  p  and  q  measurements  of  a  and  b  we  get 
NS  (nmv^ra)  =  SSf  (na.mb). 

Now  the  right-hand  side  of  this  equation  means  the  sum  of  all  pos- 
sible products  of  na  and  nib.  But  a  little  consideration  will  show  that 
this  is  exactly  what  the  numerator  of  r,  viz.  S'  (AB),  means.  Therefore 

Sf  (AB)  =  NS  (wmovv-J. 

The  two  quantities  in  the  denominator  of  r  are  found  similarly,  or  by 
putting  A  =  B  in  the  expression  just  arrived  at,  and  we  have  finally 


(nnaaaaraa)  S  (mmaborbrbb)}  ' 

wherein  it  must  be  noted  carefully  that  the  summations  in  the  de- 
nominator include  the  correlations  of  an  a  or  of  a  b  with  itself,  corre- 
lations which  of  course  are  unity.  Excluding  these  cases  from  the 
summations,  and  recognising  that  each  case  like  asat  has  a  twin  case  atas, 
we  arrive  at  the  following  form  of  the  same  formula  : 

__  S  (nmagcr^r^}  _ 
35  V(S  (»V)  +  2S  (nno-aDaf*,)}  V(S  (mV)  +  2S  (mincer*)}  * 
In  the  form  in  which  this  formula  will  perhaps  most  frequently  be 
used,  p  will  equal  q,  and  both  will  equal  2,  while  the  weights  n  and  m 
will  all  be  unity.  We  then  have 

_ 

~  A/ 

The  immediate  practical  uses  of  the  general  formula  are  perhaps  not 
so  important  as  the  theoretical  uses  to  which  Professor  Spearman  has 
applied  it. 


CHAPTER  VII 

THE  INFLUENCE  OF  SELECTION 

Influence  of  mild  selection  on  a  and  r — Rigorous  selection  and  partial  correlation — 
Three  correlated  variables  represented  by  dice  throws — Multiple  correlation — 
Spurious  correlation — Variate  difference  correlation  method. 

(1)  THE  INFLUENCE  OF  MILD  SELECTION 

THE  essential  point  about  the  whole  theory  of  correlation  is  that  it  tells 
us  how  a  group  of  individuals  selected  from  the  general  population 
according  to  some  characteristic  (say  as  being  within  certain  limits  of 
height,  or  possessing  some  mental  ability  or  manual  dexterity  in  a  high 
degree)  will  also  differ  from  the  general  population  in  other  charac- 
teristics. 

The  ordinary  correlation  coefficient  already  tells  us  much  in  this 
respect.  For  example,  if  the  correlation  between  two  abilities,  say 
(1)  the  ability,  whatever  it  may  be,  which  is  measured  by  Dr  McDougalPs 
Dotting  Machine  and  (2)  the  ability  to  memorise  Nonsense  Syllables 
according  to  certain  experimental  regulations,  be  known  to  be  -4  for 
the  whole  population,  this  means  that  if  a  group  be  selected  with 
" Dotting"  ability  equal  to  x  (measured  from  the  general  mean  in 
a  units)  then  this  group  will  most  probably  have  an  average  "  Nonsense 
Syllable"  ability  equal  to  -4x  (measured  in  a  similar  way). 

Clearly  in  practice  we  do  not  usually  know  the  means  and  the 
correlation  for  the  whole  population  but  only  for  samples.  We  take 
large  samples  and  endeavour  to  ensure  that  they  are  random  and  not 
selected  samples  from  the  population  we  wish  to  investigate. 

The  selection  contemplated  in  the  above  example  is  very  rigorous: 
all  the  individuals  are  presumed  alike  in  regard  to  "Dotting"  ability. 
In  practice  such  a  rigorous  selection  never  takes  place.  The  boys  in 
a  school  form,  for  instance,  are  more  alike  in  say  ability  in  Latin  than 
the  general  population,  yet  not  absolutely  alike.  The  "scatter"  of  this 
variate  (Latin)  has  been  reduced,  yet  not  to  zero. 

Just  as  selecting  a  group  of  individuals  for  one  variate  will  alter 
the  average  value  of  other  variates,  so  it  will  alter  the  scatter  of  these 
other  variates,  and  their  intercorrelations.  It  is  this  phenomenon  which 
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in  an  extreme  form  gives  us  what  we  already  know  as  "partial  corre- 
lation" (seep.  105). 

In  fact,  selecting  a  group  of  individuals  within  certain  limits  of  a 
quality  A  implies  an  indirect  and  less  rigorous  but  frequently  very 
important  selection  of  the  other  qualities  B,  C,  ...  of  these  individuals 
and  of  their  intercorrelations.  Consider  the  simplest  case  of  three  organs, 
A  being  directly,  B  and  C  only  indirectly  selected.  Let  subscripts  1,  2 
and  3  refer  to  A,  B  and  C  respectively,  and  let  the  standard  deviations 
and  correlations  in  the  general  population  be  al5  a2,  <r3,  r12,  r^  and  r31. 
In  the  selected  group  o^  is  reduced  by  the  selection  to  sl9  and  cr2  and  <r3 
are  indirectly  altered  to  S2  and  S3,  r12,  r^,  r31  to  r12,  r^,  I31.  Then  the 
following  formulae  enable  these  quantities  to  be  calculated*. 

Write  si/ai  —  cos  Xi- 

Then  22/(72  =  sm  aiz>   an(*  23/a3  =  sin  ais> 

where  cos  a12  =  r12  sin  xi>   and  cos  a13  =  r13  sin  Xi- 

Further  T12  =  cot  Xi  cot  a12, 

T13  =  cot  xi  cot  a13, 

^      -  COS  g 


sin  a12  sin  a13 

For  example  let  us  suppose  for  the  moment  that  the  correlations 
between  (1)  Classics,  (2)  Drawing,  and  (3)  English  have,  in  the  general 
population  of  English  Fourth  Form  boys,  the  values  found  on  pp.  104 
and  105,  viz. 

^=•42, 


and  let  us  further  suppose  that,  say  on  a  standardised  percentage  system 
of  marking,  the  standard  deviations  of  the  marks  in  these  subjects  are 

*i  =  16, 
<r2  =  13, 
a3=14. 

Now  suppose  a  mild  selection  of  Fourth  Form  boys  to  be  made  on 
the  basis  of  their  ability  in  Classics,  and  in  the  selected  group  let  us 
suppose  that  the  standard  deviation  in  marks  in  Classics  is  reduced  to 

s1  =  12. 

*  See  Karl  Pearson,  F.E.S.,  "On  the  Influence  of  Natural  Selection  on  the  Variability 
and  Correlation  of  Organs,"  Phil.  Trans.  1902,  cc.  A,  pp.  1  —  66,  where  an  interpretation 
in  terms  of  spherical  trigonometry  is  given. 
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We  then  have  cos  xi  =  SI/<TI  =  12/16  =  0-75, 

x'  =  41°  25', 
sin  x   =  0*66, 
cotx'=  1-14. 

Further          cos  a12  =  r12  sin  xi  =  '42  x  -66  =  -28, 

a12  =  73°  44',  sin  a12  =  -96, 

cot  alz  =  -29 ; 

cos  a13  =  r13  sin  xi  =  "78  x  -66  =  -51, 

a13  =  59°  0',  sin  a13  =  -86, 

cot  a13  =  -60. 

S2  =  <72  sin  a12  =  13  x  -96  =  12-5, 
23  =  (73  sin  aw  =  14  x  -86  =  12; 
I12  =  cot  xi  cot  a12  =  1-14  x  -29  =  -33, 
TXS  =  cot  xi  cot  a13  =  1*14  x  -60  =  -68, 

3~  cosa12cosa13  _  -21  —  '28  x  -51 
sin  a12  sin  a^  -96  x  -86 


T23 


=  •12. 


Putting  these  results  together  for  comparison  we  obtain  the  following 
tables : 

Standard  Deviations 


Mild  selection    Rigorous  selection 
Before  selection        in  Classics  in  Classics* 


Classics 
Drawing 

English 


16 
13 
14 


12 

12-6 
12 


0 
11-8 

8-7 


Classics  and  Drawing 
Classics  and  English 
Drawing  and  English 


Correlations 

Mild  selection     Rigorous  selection 
Before  selection       in  Classics  in  Classics* 


•42 

•78 
•21 


•33 


•12 


-•21 


This  mild  selection  for  Classics  has  therefore  left  the  scatter  of  ability 
in  Drawing  almost  untouched,  but  has  made  the  group  somewhat  more 
homogeneous  than  it  was  in  English.  The  intercorrelations  are  all 
slightly  reduced,  that  between  Drawing  and  English  being  now  almost 
nil. 

*  See  later. 
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(2)  RIGOROUS  SELECTION  AND  PARTIAL  CORRELATION 

If  we  suppose  the  selection  in  Classics  to  be  absolutely  rigorous,  so 
that  the  resulting  group  is  absolutely  homogeneous  in  ability  in  that 
subject,  then  our  formulae  simplify  considerably  and  we  are  left  with 


cos  a12  =  r12, 
cosa13  =  r13; 


I12  is  meaningless,  the  variate  1  being  fixed,  and  similarly  r13; 

T      ~  rr 

2 


This  formula  was  reached  by  Mr  Udny  Yule*  before  the  more  general 
formulae  were  known,  and  is  called  the  "partial"  correlation  of  variates 
2  and  3  for  a  constant  value  of  1,  and  is  written  r23>1.  Mr  Yule  obtained 
its  value  by  applying  to  three  variables  the  methods  we  have  already, 
following  him,  employed  on  p.  108  for  two,  using  the  Method  of  Least 
Squares.  Let  the  regression  equation  of  y  on  z  and  x  be 

y  =  &23  Z  +  1>2iX. 

This  gives  the  normal  equations  for  623  and  621 


whence,  after  a  little  simplification, 

r        _  r23  ~ 


2 

~  T31        °3 


and  a  similar  expression  for  621.   From  the  regression  of  z  on  y  and  x 
we  get  in  like  fashion 


and  rgg  x  =  either  623  or  632  when  cr2  =  cr3  =  unity,  so  that 


*  G.  Udny  Yule,  "On  the  Significance  of  Bravais'  Formulae  for  Skew  Correlation," 
Proc.  Roy.  Soc.  1896,  LX.  pp.  477 — 489. 
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If  we  apply  these  formulae  to  our  three  variables  (1)  Classics, 
(2)  Drawing,  and  (3)  English,  we  shall  get  the  standard  deviations  and 
correlations  for  a  rigorous  selection  in  Classics,  given  in  the  third  column 
of  the  above  tables.  We  see  that  the  group  is  still  heterogeneous  in 
Drawing,  but  a  good  deal  more  homogeneous  in  English.  The  correla- 
tion between  Drawing  and  English  has  now  actually  been  reversed. 
Needless  to  say,  the  actual  numbers  in  this  example  are  not  to  be  taken 
as  giving  the  facts,  being  only  used  for  the  sake  of  illustrating  the  method. 

It  is  important  to  realise,  and  becomes  clear  from  the  above  con- 
siderations, that  all  correlations  are  partial  correlations,  inasmuch  as 
there  is  always  a  selection  of  the  group  we  are  working  with,  for  age, 
or  race,  or  social  standing,  or  what  not.  Indeed  even  the  whole  living 
population  is  only  a  group,  surviving  from  "what  might  have  been," 
by  natural  selection.  This  wide  point  of  view  will  save  us  from  many 
of  the  errors  into  which  we  are  apt  to  fall  in  handling  correlation 
coefficients. 

It  is  particularly  tempting  to  draw  what  are  usually  fallacious  con- 
clusions from  the  comparison  of  "entire"  and  "partial"  correlations 
as  to  the  underlying  factors  at  work  causing  the  correlations.  For 
instance,  in  the  present  case  one  might  be  tempted  to  conclude  that  the 
original  positive  correlation  between  English  and  Drawing  was  entirely 
due  to  factors  which  these  share  with  Classics,  that  is,  to  a  general 
factor,  and  that  any  direct  connection  of  these  two  subjects  is  of  an 
"interference"  nature.  But  such  conclusions  as  to  the  underlying 
mechanism  have  to  be  made,  if  at  all,  with  great  reserve,  as  will  be  seen 
from  examining  cases  where  we  have  independent  and  first-hand  know- 
ledge of  the  factors  at  work,  as  we  have  for  example  in  dice  throwing. 
A  correlation  can  be  set  up  between  two  dice  throws  of  m  and  n  dice 
respectively  by  leaving  some  of  the  m  dice  lying  to  form  part  of  the 
second  throw. 

(3)  THREE  CORRELATED  VARIABLES  REPRESENTED  BY 
DICE  THROWS* 

Let  n  red  dice,  n  blue,  n  yellow,  and  n  white  dice  be  thrown,  and  let 
the  variable  x  be  given  by  the  combined  red  and  white,  y  by  the  com- 
bined yellow  and  white,  and  z  by  the  combined  blue  and  white  scores, 
as  in  Fig.  24. 

That  is  to  say,  there  is  a  general  factor  (the  white  dice),  common 

*  This  section  is  an  extract  from  an  article  by  Godfrey  H.  Thomson,  Brit.  Journ. 
of  Psychol.  1919,  ix.  p.  323  et  seq. 
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to  all  three  variables,  which  causes  all  the  correlations  between  them. 
These  correlations  are 

TXV  ==  Tvz  =  Tzx  =  TjT. 

Red 

n 


ite 


7? 

77 

Blue 

Yellow 

Fig.  24 

Purple 

Grans 

77  Z.  

Fig.  25 
The  partial  correlations  are,  by  the  well-known  formula, 

'  rM7  .  9;  *  tjy.  •  tr.  '9;^* .  11   ~~~    \  ~&       "     <5" 


yz.x       ex.y 


(1  - 


It  is  not  permissible,  however,  to  reverse  this  statement,  and  to  assume 
that  in  every  case  where  rxy  =  ryz  =  rzx  =  J  the  correlations  are  formed 
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solely  by  the  action  of  a  general  factor.  In  fact,  identically  the  same 
values  can  be  produced  without  any  general  factor  at  all.  Let  n  purple, 
n  green,  and  n  orange  coloured  dice  be  thrown,  and  let  the  variable  x 
consist  of  the  purple  and  orange,  y  of  the  orange  and  green,  and  z  of  the 
green  and  purple  scores  combined,  as  in  Fig.  25. 

Here  there  is  no  general  factor  whatever.  The  connection  of  x  with  y 
(through  the  orange  dice)  is  entirely  independent  of  the  connection  of  x 
with  z  (through  the  purple  dice). 

Yet  the  correlations,  both  partial  and  entire,  are  exactly  the  same  as 
in  the  first  arrangement,  viz. 


Clearly,  therefore,  if  we  only  know  of  three  variables  x,  y,  and  z, 
formed  of  dice  throws,  that  their  correlations  are  as  above,  we  cannot 
say  with  certainty  whether  a  general  factor  exists  or  not.  Let  us  now 
consider  a  more  general  arrangement  of  dice,  with  numbers  of  different 
colours,  viz.  W  white,  R  red,  B  blue,  Y  yellow,  P  purple,  G  green, 
and  0  orange,  thus : 


x  consisting  of  the  scores  of  the  W,  R,  P,  0\  y  of  the  W,  Y,  0,  G,  and 
z  of  the  W,  B,  G,  P  dice. 

In  this  arrangement  we  shall  call  W  a  general  factor,  it  being  common 
to  all  three  variables ;  R,  Y,  and  B  specific  factors,  they  being  unique  to 
x,  y  and  z  respectively;  and  0,  G  and  P  group  factors,  since  each  runs 
through  a  group  of  (here  two)  variables. 
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The  theoretical  values  of  the  correlation  between  any  two  of  the 
variables,  say  x  and  y,  can  be  found  by  means  of  the  formula* 

Number  of  dice  common  to  x  and  y 
xv  ~  Geometrical  mean  of  total  dice  in  x  and  in  y  ' 
For  example,  if  x  is  the  score  of  9n  dice,  and  y  the  score  of  4n  dice, 
2n  being  common,  the  correlation  is 

r-        *!_  -|-  0-88. 
V  9n  x  ±n      o 

The  general  arrangement  of  dice  shown  in  the  above  figure  includes  the 
two  special  cases  already  considered,  viz. 

(1)   P  =  G=0  =  zero,    R  =  B=Y=W  =  »; 
and  (2)  P  =  G  =  0  =  n,        R  =  B  =  Y  =  W  =  zero; 

which  both  give  rxy  =  ryz  =  rzx  =  £|. 

Of  the  infinite  arrangements  possible  with  this  diagram,  an  infinite 
number  (of  a  lower  order)  can  in  general  be  constructed  to  produce  any 
given  set  of  positive  correlations  between  x,  y  and  z%.  Moreover,  all 
these  possible  ways  of  producing  the  required  correlations  are,  in  our 
ignorance,  equally  likely  to  have  been  those  used  by  the  person  making 
the  arrangement  of  dice,  although  they  are,  it  is  true,  not  equally 
probable  as  chance  occurrences.  From  the  correlations,  therefore,  we 
cannot  in  general  deduce  what  proportion  the  white  dice  (i.e.  the  par- 
ticular colour  representing  the  general  factor)  bears  to  the  others,  for 
this  proportion  can  vary  between  wide  limits,  and  give  exactly  the  same 
correlations.  The  most  we  could  conceivably  do  would  be  to  give  the 

*  This  formula  was  proved  by  me  iaBrit.  Journ.  ofPsychol.  1916,  vm.  p.  275,  in  ignorance 
of  any  former  statement  of  it.  Professor  Spearman  showed  (ibid.  p.  282)  that  it  is  deducible 
from  a  formula  of  his  concerning  correlation  of  sums  or  differences.  I  have  since  noticed 
that  Professor  Spearman  gives  the  following  clear  expression  of  the  formula  (Am.  Journ.  of 
Psychol.  1904,  xv.  p.  75)  :  "The  correlation  is  always  the  geometrical  mean  between  the  two 
shares."  I  do  not  however  agree  with  the  application  he  there  proceeds  to  make.  The 
formula  can  also  be  directly  deduced  from  Bravais,  and  in  several  other  ways.  It  assumes 
that  the  elements  all  have  the  same  standard  deviation,  as  dice  have.  G.H.T. 

f  If  he  be  so  minded,  the  reader  can  make  thousands  of  other  dice  patterns,  all  giving 
the  correlations  rxv~ryz~rzx  =  ^'  ^ne  grouP  is  given  by 

R  =  B=Y=W  =  kn,    P  =  0  =  0=sn. 

But  such  a  high  degree  of  symmetry  is  unnecessary.  For  example,  another  pattern  giving 
the  same  correlations  is 


R=9In,    Y=9ln,  £=54w,   P  =  18n,   G  =  78n,   0  =  91n,    W  =  78n. 

The  number  of  white  dice  present  ranges  between  the  two  extreme  cases  given  in  the  text. 
J  A  convenient  method  of  finding  such  arrangements  or  patterns  to  give  specified 
values  of  the  correlation  coefficients  is  explained  by  J.  Ridley  Thompson,  Brit.  Journ. 
of  Psychol  1919,  x.  p.  98. 
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"expectation"  of  the  proportion  of  white  dice.  The  meaning  of  this 
would  be,  that  if  a  very  large  number  of  arrangements  of  dice  were 
examined,  each  giving  the  required  set  of  correlation  coefficients,  and 
if  we  assume  that  these  arrangements  of  dice  are  not  formed  on  any  plan, 
beyond  that  they  all  agree  in  the  correlations  they  produce,  then  the 
average  proportion  of  white  dice  would  be  that  named  as  the  "expec- 
tation "  thereof.  But  if  there  is  any  reason  to  think  that  the  large  number 
of  cases  examined  are  all  of  much  the  same  pattern — as  there  would  be 
were  they  all  natural  phenomena  of  the  same  sort — then  the  "  expecta- 
tion" of  the  proportion  of  white  dice  becomes  useless  and  meaningless. 
We  cannot  conclude  anything  which  is  of  any  definite  value  in  con- 
structing the  pattern,  except  give  limits  within  which  it  must  lie. 

This  brings  us  to  the  problem : — Are  there  any  values  of  rxy,  ryz  and 
rzx  which  make  it  certain  (having  regard  to  their  probable  errors)  that 
at  any  rate  some  general  factor,  some  number  of  white  dice,  exists?  The 
answer  is  that  this  is  so  if  the  correlations  are  large  enough.  For  example, 
if  we  take  the  special  case  of  equality  of  the  three  coefficients, 

^xv  ~  Tyz  ==  rzx  =  T) 

then  up  to  r  =  J  the  correlations  can  be  imitated  by  various  numbers  of 
red,  blue,  yellow,  purple,  green  and  orange  dice  without  any  white  dice. 
But  as  soon  as  the  common  value  r  rises  above  J,  some  white  dice  are 
necessary.  In  this  case  therefore  the  proof  of  the  existence  of  (at  any 
rate)  some  amount  of  general  factor  reduces  to  the  examination  of  the 
probable  error  of  r,  to  see  if  r  is  indisputably  greater  than  J. 

In  the  more  general  case  the  matter  is  not  so  simple,  the  three  values 
of  r  differing  from  one  another.  The  more  detailed  examination  of  this 
case  is  reserved  for  treatment  elsewhere.  It  will  be  found  however  that 
if  the  quantity 

TXV*  +  fv?  +  rzx2  +  torfrfn 

is  indisputably  greater  than  unity,  then  some  white  dice,  some  general 
factor,  may  be  postulated  with  certainty*.  It  may  not  be  out  of  place 
to  remind  ourselves  again  that  this,  though  true  of  the  arrangements  of 
dice  we  are  considering,  may  not  be  true  in  the  same  sense  of  other 
phenomena,  e.g.  biological  or  mental  phenomena. 

The  following  two  examples  illustrate  the  above  principles. 

*  A  rough  guide  is  the  average,  value  of  the  three  r's:  if  this  is  greater  than  \  some 
general  factor  certainly  exists.  The  exact  condition  however  is  that  given  in  the  text. 
It  is  due,  in  this  form,  to  Mr  J.  R.  Thompson,  see  Brit.  Journ.  of  Psychol.  1919,  ix.  p.  335. 
Note  that  all  this  only  applies  to  correlations  produced  by  overlapping  dice  throws,  or  by 
some  sufficiently  similar  mechanism. 
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Example  A 

Three  variables,  composed  of  overlapping  dice  throws,  give  corre- 
lations as  follow: 

rxy  =  0-32,     rvz  =  0-33,    rzx  =  0-54. 
Are  any  dice  common  to  the  three  variables? 
In  this  case  we  find 

rxy2  +  ry*  +  rzx2  +  2rxvrvzrzx  =  0-56,   i.e.  <  1. 

From  this  we  conclude  that  these  correlations  can  be  imitated  either 
with  or  without  a  general  factor  of  white  dice.  The  following  arrange- 
ments of  dice  do  actually  produce  these  correlations. 

Case  (1).    R  =  19ft,  B  =  17ft,  Y  =  85ft,    (specific  factors), 
P  =  G  =  0  =  zero,  (no  group  factors), 

W  =  21ft,  (a  general  factor). 

Case  (2).    R  =  15ft,  B  =  15ft,  Y  =  16ft,    (specific  factors), 
P  =  47ft,  G  =  25ft,  0  =  24ft,     (group  factors), 
W  =  zero,  (no  general  factor). 

Example  B 

Three  variables,  composed  of  overlapping  dice  throws,  give  corre- 
lations as  follow: 

rxy  =  0-72,     ryz  =  0-77,     rzx  =  0-67. 

Are  any  dice  common  to  the  three  variables? 
In  this  case  we  have 

rxv2  +  rv?  +  rz*  +  2rxyryzrzx  =  1-93,   i.e.  >  1. 

We  conclude  therefore  that  some  white  dice  common  to  the  three 
variables  are  present,  i.e.  that  there  is  a  general  factor.  The  following 
arrangements  of  dice  do  actually  produce  these  correlations,  the  general 
factor  being  a  minimum  in  one  and  a  maximum  in  the  other. 

Case  (1).    R  =  156ft,  B  =  104ft,  Y  =  52ft,    (specific  factors), 
P  =  G  =  0  =  zero,  (no  group  factors), 

W  =  260ft,  (general  factor). 

Case  (2).    R  =  B  =  Y  =  zero,  (no  specific  factors), 

P  =  90ft,  G  =  161ft,  0  =  123ft,     (group  factors), 
W  =  198ft,  (general  factor). 

We  see  then  that  if 

the  presence  of  some  white  dice  is  certain.  If  the  above  quantity,  which 
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we  shall  call  D,  is  equal  to  or  less  than  unity,  the  presence  of  white  dice 
is  uncertain.  Suppose  we  consider  two  cases  in  which 

rw  =  0-8,     rw  =  0-4,     r2X  =  0-l,     D  =  0-842, 

and  r^  =  0-2,     ryz  =  0-2,     rzx  =  0-1,     D  =  0-094,  respectively. 

Can  we  in  these  two  cases  say  anything  as  to  the  probability  of  the 
existence  of  some  general  factor? 

The  answer  to  this  question  is  twofold.  If  we  suppose  that  the  person 
making  the  arrangements  of  dice  has,  among  all  the  possible  arrange- 
ments giving 

^  =  0-8,     ^  =  0-4,     rzx  =  Q-l,     D  =  0-842, 

chosen  one  by  chance  selection,  and  if  we  suppose  that  the  other 
arrangement  giving 

rw  =  0-2,     rvz  =  0-2,     rzx  =  0-l,     D  =  0-094 

has  similarly  been  chosen  by  chance  selection,  then  it  is  much  more 
probable  that  a  general  factor  exists  in  the  first  than  in  the  second  case. 
This  probability  will  in  fact  rise  and  fall  with  D  though  it  is  not  measured 
byZ). 

But  if  the  person  making  the  arrangements  of  dice  has  any  definite 
rules  which  he  follows  in  making  the  patterns,  then  the  above  probability 
will  have  much  less  meaning.  In  the  former  instance  it  would  express 
the  average  to  be  found  in  many  cases;  in  the  latter  it  will  no  longer 
express  even  such  an  average  value,  and  will  have  no  practical  worth. 

Before  leaving  for  the  present  the  subject  of  dice  throws  two  points 
may  be  mentioned.  (1)  Negative  correlations  may  be  imitated  by  dice 
being  added  to  one,  but  subtracted  from  the  other,  variable*.  (2)  There 
are  many  ways  conceivable  in  which  correlations  can  be  produced  other 
than  by  common  factors.  Consider  for  example  the  positive  correlation 
between  the  number  of  hearts  in  my  hand  and  the  number  of  spades 
in  my  partner's,  at  whist.  Doubtless  the  correlations  found  in  mental 
phenomena  are  in  many  instances  still  more  subtle  in  origin. 

(4)  MULTIPLE  CORRELATION 

The  full  significance  of  correlation  is  only  to  be  realised  after  a  careful 
study  of  the  general  theory  of  correlation  of  numerous  variables,  of 
which  the  correlation  of  two  variables,  measured  by  the  correlation 
coefficient  r,  is  only  a  particular  case.  The  classic  memoir  on  the  subject 
is  that  by  Professor  Karl  Pearson  on  "Regression,  Panmixia  and 

*  See  J.  R.  Thompson,  "The  R61e  of  Interference  Factors  in  Producing  Correlation," 
Brit.  Journ.  of  Psychol.  191$,  x.  pp.  81 — 100. 
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Heredity,"  in  1896*,  which  is  however  too  advanced  for  quotation  at 
any  length  here.  In  1907  Mr  G.  Udny  Yule  introduced  a  new  notation| 
and  made  various  improvements,  and  his  formulae  will  now  be  briefly 
summarised. 

The  regression  equation  for  the  two  variates,  x  and  y,  both  measured 
from  their  means,  was  found  in  a  former  section  to  be  of  the  form 


the  equation  being  obtained  by  an  application  of  the  method  of  least 
squares. 

It  is  evident  that  the  method  thus  applied  to  two  variates  might, 
without  further  assumptions,  be  used  in  the  general  case  of  n  variates. 
Thus,  if  xl9  x2,  ...  xn  denote  deviations  from  means,  the  equation  ex- 
pressing the  regression  of  xl  on  x2  .  .  .  xn  could  be  written 

Xl  =  &12.34-  A  +  &13.24...  A  +   —  +  ^In^.-n-A- 

"  In  this  notation,  the  suffix  of  each  regression  coefficient  completely 
defines  it.  The  first  subscript  gives  the  dependent  variable,  the  second 
the  variable  of  which  the  given  regression  is  the  coefficient,  and  the 
subscripts  after  the  period  show  the  remaining  independent  variables 
which  enter  into  the  equation"  (Yule,  op.  dt.  p.  182). 

In  terms  of  the  same  notation,  write 

r12.34-.-n  =  (&12.34...W&21.34  —  n)*> 

Xl.23-"n  —  xl  ~  (^12.34».«^2  +  •••  ^lw.23  —  n-lxn)> 


Then  the  normal  equations,  from  which  the  regressions  are  deter- 
mined by  the  method  of  least  squares,  may  be  written 

S  («a-ai.28...n)  =  S  (z3-Zi.23...n)  =   •••  =  $  (Xn'Xl.23.~n)  =  0. 

From  these  equations  Mr  Yule  has  deduced  the  following  results: 

Z,  _  r  a1.34...n 

U12.  34  —  n  ~  '12.34  —  n  ~~  j 


_. 

(^-r\n.^...n 

r12.34...n  is  known  as  a  "partial"  correlation  coefficient,  being  the  value 
of  the  correlation  between  1  and  2  for  constant  values  of  3,  4  ...  n. 
Similarly,  bl2tM..,n  is  a  "partial"  regression  coefficient  (cf.  "partial 

*  Phil.  Trans,  cxcvm.  A,  pp.  443—459. 

t  Proc.  Boy.  Soc.  1907,  LXXIX.  A,  pp.  182—193. 

B.  &  T.  10 


H6  CORRELATION  [PT.  n 

differentiation"  in  the  Differential  Calculus).  Knowing  the  "total" 
correlations  r12,  r13,  r23,  etc.,  we  are  enabled  to  obtain  the  various  partial 
coefficientsby  successive  substitutions.  Thus,  in  the  case  of  three  variables, 
1,  2,  3,  we  have 

ri2.3=    /    _12    »Vl-r  2* 

V  1  —  ^13   VI    -  7-23 

and  two  similar  equations,  expressing  the  value  of  the  correlation  be- 
tween two  of  the  variables  for  a  constant  value  of  the  third*.  If  a 
fourth  variable  be  added,  we  have  the  further  set  of  equations 

r!2.3  ~~  ^14.3r24.3 

"•""(l-^.J*(l-HW.J»- 

Perhaps  a  more  convenient  formula  for  obtaining  the  partial  corre- 
lation of  two  variables  for  constant  values  of  a  third  and  fourth  is 


When  the  partial  correlation  coefficients  have  been  determined,  the 
regressions  can  be  found  by  substituting  in  the  appropriate  equations, 
and  give  at  once  the  regression  equations.  As  explained  before,  a 
regression  equation  gives  the  most  probable  value  of  one  variable  for 
given  values  of  the  remaining  variables,  the  standard  error  in  such  a 
prediction  being  o-1>23...n,  e^c- 

An  example  of  the  method  of  applying  the  above  formulae  is  given 
in  the  next  few  paragraphs,  where  the  partial  correlations,  regressions, 
etc.  in  the  case  of  four  interrelated  psychical  capacities  are  worked  out 
on  lines  identical  with  those  illustrated  by  Mr  Yule  in  his  paper. 

Example  of  Multiple  Correlation  f 
(Boys,  ages  11—12;  n  =  66.) 

1.  Crossing  through  two  letters  (e  and  r). 

2.  Crossing  through  four  letters  (a,  n,  o,  s). 

3.  Combination  test. 

4.  Mechanical  memory  test. 

\ 

Formula  for  multiple  correlation  : 

r!2.34...n-l  ~  rln.34...n-lr2n.34...n-l 

°(l-^.«...^(l-^.«....-i)t' 

*  For  an  interesting  representation  of  correlation  between  three  variables  by  a  model 
showing  the  distribution  of  points  in  space,  see  G.  Udny  Yule,  An  Introduction  to  the 
Theory  of  Statistics,  C.  Griffin  and  Co.,  London,  1911,  pp.  241—243. 

t  W.  Brown,  Brit.  Journ.  of  Psychol.  1910,  in.  p.  317. 
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For  four  variables  this  becomes : 

r!2.3  ~~  r!4.3r24.3 
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r!2.34  = 


(1- 


'12- 


13 '23 


(1-  fa*)*  (I- 
Table  I 


Correlation  coefficient 

log(l-r') 

12 
13 
14 
23 
24 
34 

0-78 
0-45 
0-40 
0-48 
0-29 
0-52 

1-59284 
1-90173 
1-92428 
1-88627 
1-96185 
1-86308 

Table  II 


Correlation  coefficient 
(zero  order) 

Product  term 
of  numerator 

Numerator 

Correlation  coefficient 
(first  order) 

log(l-H) 

12 
13 
23 

0-78 
0-45 
0-48 

0-2160 
0-3744 
0-3510 

0-5640 
0-0756 
0-1290 

12-3 
13-2 
23-1 

0-7199 
0-1377 
0-2308 

1-68281 
1-99169 
1-97623 

12 
14 
24 

0-78 
0-40 
0-29 

0-1160 
0-2262 
0-3120 

0-6640 
0-1738 
-0-0220 

12-4 
14-2 
24-1 

0-7570 
0-2902 
-0-0386 

1-63038 
1-96179 
1-99348 

13 
14 
34 

0-45 
0-40 
0-52 

0-2080 
0-2340 
0-1800 

0-2420 
0-1660 
0-3400 

13-4 
14-3 
34-1 

0-3091 
0-2176 
0-4154 

1-95639 
1-97893 
1-91774 

23 
24 
34 

0-48 
0-29 
0-52 

0-1508 
0-2496 
0-1392 

0-3292 
0-0404 
0-3808 

23-4 
24-3 
34-2 

0-4027 
0-0539 
0-4536 

1-92316 
1-99873 
1-89996 

Table  III 


Correlation  coefficient 
(first  order) 

Product  term 
of  numerator 

Numerator 

Correlation  coefficient 
(second  order) 

log(l-r') 

12-4 
13-4 
23-4 

0-7570 
0-3091 
0-4027 

0-1245 
0-3048 
0-2340 

0-6325 
0-0043 
0-1687 

12-34 
13-24 
23-14 

0-727 
0-007 
0-272 

1-67345 
1-99998 
1-96662 

12-3 
14-3 
24-3 

0-7199 
0-2176 
0-0539 

0-0117 
0-0388 
0-1567 

0-7082 
0-1788 
-0-1028 

12-34 
14-23 
24-13 

0-727 
0-258 
-0-152 

1-97009 
1-98985 

13-2 
14-2 
34-2 

0-1377 
0-2902 
0-4536 

0-1316 
0-0625 
0-0400 

0-0061 
0-2277 
0-4136 

13-24 
14-23 
34-12 

0-007 
0-258 
0-436 

1-90843 

23-1 
24-1 
34-1 

0-2308 
-0-0386 
0-4154 

-0-0160 
0-0959 
-0-0089 

0-2468 
-0-1345 
0-4243 

23-14 
24-13 
34-12 

0-272 
-0-152 
0-436 

— 

10—2 
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The  regression  equation  between  changes  in  intelligence  (as  measured 
by  the  combination  test)  and  changes  in  the  other  three  variables  is 

X3  =  ^31  .  24  Xl  +  ^32  .  14  X2  +  ^34  .  12  xf 

Calculation  of  regression  coefficients. 

*!  =  68-19,     <72  =  5843,     a3  =  16-34,     <r4  =  9-70, 

^1.23...n  =  *f  (1  -  /122)  (1  -  r2l3.2)  (1  -  fV«)  &  (1  ~  *\n  .  23  ...n-l)- 
•'.    *a.M=*s(l  -'»*)*(!  -^.i..)* 

=  12-77. 
Similarly, 

°\.  24=40-83,    (73.14  =  13-27,    <72<14=36-27,    (J3.12  =  14-23,    a4<12  =  8-82. 

Again, 


<72.34...n 
3.24 


- 

.  24  ~~  '  31  .  24 


Similarly,  632  .  14  =  -099,     b^  .  12  =  -703. 

Hence  regression  equation  is 

x3  =  '002X!  +  '099x2  +  '703x4  . 

The  standard  error  (cj3>124)  made  in  estimating  x3  from  xv  x2  and  «4 
by  this  equation 

=  C73  (1  -  f312)*(l  -!*„.!)*  (1  -l^at.lt)* 

=  12-78. 

(5)  SPURIOUS  CORRELATION 

Correlation  is  said  to  be  spurious  when  it  is  due  to  extraneous  con- 
ditions and  does  not  arise  directly  out  of  the  functions  under  con- 
sideration. The  term  is  one  of  relative  and  not  of  absolute  significance, 
but  its  appropriateness  will  become  apparent  after  a  consideration  of 
the  following  two  examples: 

1  .    Heterogeneity  of  material. 

Let  us  suppose  that  two  distinct  groups  of  children  have  been 
measured  for  two  characters  A  and  B,  and  that  the  mean  abilities,  in 
both  A  and  B,  are  higher  in  the  one  group  than  in  the  other.  Then, 
even  if  there  is  no  correlation  between  the  two  characters,  as  estimated 
from  each  group  separately,  a  positive  correlation  will  be  obtained  from 
the  two  groups  taken  together.  On  the  other  hand,  if  the  mean  ability 
in  A  is  higher,  and  the  mean  ability  in  B  lower,  in  the  one  group  than 
in  the  other,  a  negative  correlation  will  be  obtained  by  taking  the  two 
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groups  together.  The  correlation  in  each  case  will  be  due  simply  to  the 
heterogeneity  of  the  material  employed.  The  difference  in  the  mean 
values  for  the  two  groups  must  of  course  have  some  cause,  such  as  a 
difference  of  nationality,  sex,  or  even  locality  (within  any  one  town  or 
district)  from  which  the  children  are  drawn;  or,  again,  such  a  cause  as 
the  difference  of  discipline  to  which  the  two  groups  have  been  subjected 
in  the  past.  These  are  extraneous  conditions  and,  if  measurable,  can 
be  allowed  for  by  employing  the  method  of  partial  correlation.  As  a 
rule,  however,  they  are  not  easy  to  determine  quantitatively;  hence 
their  dangerous  character. 

2.    Index  Correlation*. 

This  is  a  form  of  spurious  correlation  which  arises  from  the  use  of 
ratios  for  measurements.   Thus  if 

x-,  x» 

M       __i.        y        & 

Zl~  r    >Z2~  „    > 
X3  X3 

and  a^,  x2,  x3  are  uncorrelated  with  one  another,  it  can  be  shown  that 
the  correlation  between  z±  and  z2,  is 


a  quantity  which  may  be  as  large  as  *5. 

As  an  illustration  from  psychology,  we  may  mention  the  correlation 
of  errors  of  observation,  either  of  different  individuals  or  of  the  same 
individual  at  different  times.  Even  if  the  absolute  errors  are  entirely 
uncorrelated,  the  percentage  errors  (errors  expressed  as  percentages  or 
other  fractions  of  the  observed  quantity)  will  often  show  a  large  amount 
of  correlation.  In  such  a  case  as  this,  the  important  question  arises 
whether  we  are,  from  other  considerations,  justified  in  using  ratios  (or 
indices)  rather  than  absolute  measures.  If  we  are,  the  index  correlation 
is  not  to  be  regarded  as  spuriousf. 

*  Karl  Pearson,  "On  a  form  of  Spurious  Correlation  which  may  arise  when  Indices 
are  used  in  the  Measurement  of  Organs,"  Proc.  Hoy.  Soc.  1897,  LX.  p.  489.  In  a  note 
following  Pearson's  paper  Sir  Francis  Galton  illustrates  the  occurrence  of  index  correlation 
by  a  simple  and  illuminating  example. 

f  Spurious  correlation  of  this  (as  of  any)  kind  can  be  eliminated  by  partial  corre- 
lation. 
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(6)  VARIATE  DIFFERENCE  CORRELATION,  OR  THE  ELIMINATION  OF 
SPURIOUS  CORRELATION  DUE  TO  POSITION  IN  SPACE  AND  TIME* 

This  is  a  method  for  determining  the  correlation  of  variations  from 
the  "instantaneous  mean,"  by  correlating  corresponding  differences 
between  successive  values.  If  two  variates  x  and  y  are  such  that 

x  =  <£  (t)  +  X, 


where  X  and  Y  are  the  parts  of  x  and  y  independent  of  the  time  t,  then 
it  can  be  shown  that  on  certain  not  unreasonable  assumptions, 


if  m  is  large  enough,  where  mDx  is  the  mth  difference  between  successive 
values  of  x,  and  similarly  mDy.  For  example  : 


47 

6 
53  -4 

2  4 

55  0  -5 

2  -1 

57  -1 

1 
58 

Clearly  the  number  of  corresponding  cases  to  be  correlated  is  reduced 
each  time.  The  method  is  only  valid,  inter  alia,  when  there  are  so  many 
cases  in  the  x  column  that  this  reduction  is  immaterial.  The  following 
numerical  example  will  it  is  hoped  serve  to  make  clear  any  obscure 
points  in  the  above  necessarily  short  account.  The  correlations  between 
differences  are  worked  out  until  r  remains  steady  for  several  successive 
differences. 

Numerical  illustration  of  the  Variate  Difference  Correlation  Method^ 

The  numbers  headed  Savings  and  Tobacco  respectively  are  "indices" 
from  Professor  Georgio  Mortara's  article  in  the  Giornale  degli  Economiste 
e  Rivista  di  Statistica,  February  1914.  As  they  stand,  they  include  a 
continuous  secular  increase  both  of  savings  and  of  consumption  of 
tobacco  which  has  occurred  in  Italy  during  the  period  in  question,  and 
their  correlation  is  -984.  But  first  differences  only  correlate  to  an  extent 
•766  as  is  shown  in  detail  in  the  following  working: 

*  Miss  F.  E.  Cave,  Proc.  Roy.  Soc.  1904,  LXXIV.  p.  407;  R.  H.  Hooker,  Journ.  Ray. 
Stat.  Soc.  1905,  LXVIII.  p.  396;  "Student,"  Biometrika,  1914—15,  x.  p.  179;  and  Anderson, 
Biometrika,  1914  —  15,  x.  p.  269,  where  probable  errors  are  given. 

t  From  Beatrice  M.  Cave  and  Karl  Pearson,  Biometrika,  1914—15,  x.  p.  340. 
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Year 

Savings 

X 

A 

From 
5 

Sq. 

Product 

Sq. 

From 
3 

A 

Tobacco 

y 

1885 

47 

82 

6 

1 

1 

1 

1 

1 

4 

1886 

53 

86 

2 

-3 

9 

3 

1 

-1 

2 

1887 

55 

88 

2 

-3 

9 

15 

25 

-5 

-2 

1888 

57 

86 

1 

-4 

16 

12 

9 

-3 

0 

1889 

58 

86 

1 

-4 

16 

8 

4 

-2 

1 

1890 

59 

87 

1 

-4 

16 

8 

4 

-2 

1 

1891 

60 

88 

4 

-1 

1 

2 

4 

-2 

1 

1892 

64 

89 

1 

-4 

16 

8 

4 

-2 

1 

1893 

65 

90 

0 

-5 

25 

20 

16 

-4 

-1 

1894 

65 

89 

3 

-2 

4 

8 

16 

-4 

-1 

1895 

68 

88 

2 

-3 

9 

9 

9 

-3 

0 

1896 

70 

88 

3 

-2 

4 

8 

16 

-4 

-1 

1897 

73 

87 

2 

-3 

9 

3 

1 

-1 

2 

1898 

75 

89 

4 

-1 

1 

1 

1 

-1 

2 

1899 

79 

91 

4 

-1 

1 

2 

4 

-2 

1 

1900 

83 

92 

4 

-1 

1 

0 

0 

0 

3 

1901 

87 

95 

5 

0 

0 

0 

1 

-1 

2 

1902 

92 

97 

7 

2 

4 

-2 

1 

-1 

2 

1903 

99 

99 

8 

3 

9 

0 

0 

0 

3 

1904 

107 

102 

8 

3 

9 

3 

1 

1 

4 

1905 

115 

106 

12 

7 

49 

0 

0 

0 

3 

1906 

127 

109 

17 

12 

144 

36 

9 

3 

6 

1907 

144 

115 

11 

6 

36 

36 

36 

6 

9 

1908 

155 

124 

13 

8 

64 

32 

16 

4 

7 

1909 

168 

131 

12 

7 

49 

28 

16 

4 

7 

1910 

180 

138 

7 

2 

4 

6 

9 

3 

6 

1911 

187 

144 

5 

0 

0 

0 

16 

4 

7 

1912 

192 

151 

145 

506 

247 

220 

69 
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In  the  table  the  variate  x  is  the  Savings  Index,  y  is  the  Tobacco 
Index.  We  then  have 

Mean  of  J)x  =  145/27  =  5-37, 
Mean  of  A  =    69/27  =  2-56. 

Take  5  and  3  as  provisional  centres,  so  that  d1  =  -37,  d2  —  —  -44. 
From  deviations  from  these  points  we  get,  as  shown  in  the  table, 

S  (A2)  =  506, 

S  (A  A)  =  247, 

S  (iPS)  =  220. 

Therefore  the  correlation  (correcting  for  d±  and  d2)  is 

f. ?^±» =  .77 

V(506  -  4)  V(220  -  5) 

Second  differences  have  a  small  negative  correlation,  which  increases 
till  with  sixth  differences  we  reach  —  -431,  which  seems  to  indicate  that, 
when  time  has  been  eliminated,  expenditure  on  tobacco  in  any  year 
means  less  money  saved. 


CHAPTER  VIII 

THE  CORRECTION  OF  RAW  CORRELATION  COEFFICIENTS 

Historical  account — The  elimination  of  irrelevant  factors — Reliability  coefficients — 
Correction  for  observational  errors. 

(1)  HISTORICAL  ACCOUNT 

THE  history  of  the  use  of  the  theory  of  correlation  in  Psychology  can 
hardly  be  said  to  have  begun  earlier  than  the  commencement  of  the 
present  century.  During  the  previous  twenty  years,  indeed,  a  great  deal 
of  work  had  been  done  by  many  observers  in  measuring  simple  mental 
abilities  (by  the  "mental  test"  method)  in  larger  or  smaller  groups  of 
subjects,  and  attempts  had  even  been  made  to  determine  in  what  way 
these  abilities  were  related  to  one  another  and  to  more  general  mental 
ability,  or  "general  intelligence." 

Owing,  however,  to  a  universal  lack  of  knowledge  of  the  mathe- 
matical theory  of  correlation  among  psychologists  during  this  period, 
the  results  were  not  obtained  in  a  form  suitable  for  comparison  with 
one  another,  so  that  it  is  not  surprising  to  find  that  they  hopelessly 
contradict  one  another.  The  heterogeneity  of  the  material  worked 
with,  the  non-elimination  of  irrelevant  factors,  and  the  absence  of  any 
measure  of  the  "probable  error"  of  the  results  make  the  conclusions 
drawn  by  the  investigators  themselves  from  their  researches  utterly 
unreliable. 

The  first  investigation  showing  any  mathematical  precision  was  that 
published  by  Clark  Wissler*  in  1901.  It  contained  (inter  alia)  an  account 
of  the  careful  application  of  a  large  number  of  simple  mental  tests  upon 
over  200  college  students,  and  a  correlation  of  the  results  with  one 
another  and  with  the  students'  marks  in  the  various  subjects  of  the 
college  curriculum.  The  mental  tests  were  found  to  correlate  but  slightly 
with  one  another  or  with  ability  in  college  subjects  of  study,  though 
these  latter  showed  considerable  correlation  with  one  another  («30 — 75). 

In  the  following  year  Aikens  and  Thorndikef  published  results  which 

*  Clark  Wissler,    "The  Correlation  of    Mental  and  Physical  Tests,"   Psychological 
Review,  Monograph  Supplement,  m.  No.  16,  June  1901. 

|  "Correlations  among  Perceptive  and  Associative  Processes,"  Psychological  Review,  EC. 
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were  in  a  sense  confirmatory  of  those  of  Wissler,  since,  notwithstanding 
the  greater  similarity  to  one  another  of  the  functions  investigated  than 
in  Wissler's  research,  the  correlations  were  again  found  to  be  low.  For 
example,  different  tests  devised  for  the  measurement  of  "speed  of 
association  "  were  found  to  show  hardly  any  correlation — a  result  which 
seemed  to  furnish  some  justification  for  the  author's  statement  that 
"quickness  of  association  as  an  ability  determining  the  speed  of  all  one's 
associations  is  a  myth"  (op.  tit.  p.  375).  A  similar  lack  of  close  relation- 
ship was  found  in  the  case  of  other  mental  functions  which  would,  on 
the  evidence  of  introspection  alone,  be  confidently  classed  as  particular 
instances  of  the  same  general  mental  function. 

In  1904  there  appeared  an  epoch-making  article  by  Professor  C. 
Spearman*,  the  ideas  originating  in  which  have,  at  least  in  England, 
ever  since  dominated  correlational  work  in  its  applications  to  psychology. 
Since  we  shall  have  frequent  occasion  in  the  course  of  this  and  the 
succeeding  chapter  to  take  exception  to  Professor  Spearman's  theories 
and  mathematical  methods,  which  appear  to  us  incorrect  and  harmful, 
we  may  perhaps  be  allowed  at  this  point,  before  embarking  on  contro- 
versial matters,  to  express  our  opinion  that  only  Professor  Spearman's 
enthusiasm  and  originality  could  have  given  to  psychological  correlation 
research  the  life  and  activity  which  it  has  shown  during  the  last  fifteen 
years.  His  work  has  stirred  up  both  disciples  and  opponents  to  investi- 
gations which  would  otherwise  never  have  occurred  to  them. 

The  new  ideas  in  question  fall  into  two  main  groups. 

(1)  Corrections  to  the  raw  values  of  correlation  coefficients.  Instead  of 
measuring  large  numbers  of  individuals,  as  his  predecessors  had  done, 
Professor  Spearman  contented  himself  with  small  numbers,  groups  of 
less  than  40  in  his  first  research,  and  as  few  as  11  in  the  second;  but  he 
proposes  to   make  up  for  the   unreliability  thus  introduced  into  the 
results  by  a  more  careful  measurement  of  his  cases,  and  the  application 
of  "corrections"  to  the  "raw"  values  of  his  correlation  coefficients  by 
means  of  appropriate  mathematical  formulae. 

(2)  The  discovery  of  "  hierarchical "  order  among  correlation  coefficients, 
and  the  Theory  of  General  Ability,  or  the  Theory  of  Two  Factors,  which 
has  been  built  up  on  this  foundation.  This  theory  has  been  a  great  incentive 
to  research,  and  may  possibly  correspond  to  the  facts,  though  we  do 
not  incline  to  think  so.    But  its  deduction  from  the  occurrence  of 
"hierarchical"  order  among  the  correlation  coefficients  is  invalid,  as 

*  "General  Intelligence  objectively  determined  and  measured,"  Amer.   Journ.   of 
Psychol  xv.  pp.  201—292. 
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will  be  shown  in  the  next  chapter.  In  the  present  chapter  we  turn  to 
the  closer  consideration  of  the  first  group  of  ideas,  the  correction  of  raw 
correlation  coefficients. 


(2)  THE  ELIMINATION  OF  IRRELEVANT  FACTORS 

The  first  kind  of  correction  is  that  for  the  elimination  of  irrelevant 
factors,  and  is  nothing  new  in  the  theory  of  correlation,  being  simply 
the  method  of  partial  correlation  described  in  the  last  chapter. 

For  example,  to  eliminate  the  effects  of  difference  of  age  in  the  group 
experimented  upon,  one  would  determine  the  partial  correlation  between 
the  two  characters  under  consideration,  for  "age  constant,"  by  means 
of  the  Yulean  formula  given  on  p.  137.  A  similar  procedure  is  needed 
for  eliminating  the  effects  of  difference  of  sex,  etc.  A  preferable  course, 
however,  would  be  to  dispense  as  far  as  possible  with  the  necessity  for 
such  corrections  by  selecting  groups  of  individuals  of  the  same  age, 
sex,  etc.  Indeed,  as  Professor  Spearman  very  properly  points  out*, 
the  partial  correlation  formula  must  on  no  account  be  used  in  cases 
where  there  is  too  violent  heterogeneity  of  the  irrelevant  factor.  For 
example,  we  might  with  justice  use  it  to  eliminate  the  effects  of  age  in 
a  group  where  the  extreme  differences  of  age  were  only  over  a  range  of 
two  or  three  years,  but  not  in  a  group  where  the  subjects  ranged  from 
say  five  years  to  fifteen  years  of  age. 

(3)  RELIABILITY  COEFFICIENTS 

The  second  kind  of  correction  introduced  by  Professor  Spearman  is 
a  correction  for  what  he  calls  "accidental"  errors.  This  correction  is 
based  on  the  "reliability  coefficients"  introduced  by  Professor  Spearman, 
and  as  these  reliability  coefficients  are  in  themselves  a  very  useful  con- 
ception quite  apart  from  their  employment  in  this  connection,  we  take 
this  opportunity  of  explaining  them.  *• 

Anyone  who  has  carried  out  'psychological  experiments,  or  even  an 
ordinary  examination  j^pn.  the  same  subjects  with  similar  tests  on  two 
or  more  different  o*ccJPsions  Aoes  not  nee<y*>  be  reminded\tjiat  the  results 
will  differ,  sometimes  vfe^f  decidedly.  The  order  of  merit  of  the  children 
in  a  test  to-day  will  not  be  the  same  as  their  order  of  merit  in  a  closely 
similar  test  to-mori?ow.  Unless  however  the  differences  are  only  slight, 
it  is  clear  that  the  test  or  examination  is  of  no  practical  use.  Its  re- 

*  "Demonstration  of  Formulae  for  the  True  Measurement  of  Correlation,"  Amer. 
Journ.  of  Psychol.  1907,  xvm.  especially  p.  166. 
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liability  can  be  conveniently  measured  by  the  correlation  coefficient  of 
the  marks  obtained  on  the  two  different  occasions.  Such  a  correlation 
coefficient  is  called  a  reliability  coefficient.  In  practice  tests  which  give 
reliability  coefficients  lower  than  0-6  are  useless,  and  the  ideal  would 
be  a  great  deal  higher  than  this. 


(4)  CORRECTION  FOR  OBSERVATIONAL  ERRORS 

It  is  by  the  aid  of  these  reliability  coefficients,  as  we  have  said,  that 
Professor  Spearman  carries  out  the  calculations  which  have  for  their 
object  the  elimination  of  observational  errors. 

It  is  clear  that  if  the  correlation  between  two  series  of  quantities  is 
really  perfect,  then  any  observational  errors  made  in  measuring  these 
quantities  can  only,  and  will  actually,  reduce  the  correlation.  The 
amount  of  this  reduction  will  probably  be  greater,  the  greater  are  the 
observational  errors.  The  size  of  these  is  however  indicated  to  some 
extent  by  the  reliability  coefficients,  so  that  it  is  but  a  short  step  to  use 
these  to  correct  for  the  reduction,  and  enlarge  the  correlation  coefficient 
to  its  true  value.  An  example  given  by  Professor  Spearman  himself* 
shows  this  so  clearly  that  it  is  not  out  of  place  to  repeat  it  here. 

"A  target  was  constructed  of  a  great  many  horizontal  bands, 
numbered  from  top  to  bottom.  Then  a  man  shot  successively  at  a 
particular  series  of  numbers  in  a  particular  order.  Clearly,  the  better 
the  shot,  the  less  numerical  difference  between  any  number  hit  and  that 
aimed  at;  now,  just  as  the  measurement  of  any  object  is  quite  appro- 
priately termed  a  '  shot '  at  its  real  value,  so,  conversely,  we  may  perfectly 
well  consider  the  series  of  numbers  actually  hit  in  the  light  of  a  series 
of  measurements  of  the  numbers  aimed  at.  When  the  same  man  again 
fired  at  the  same  series,  he  thereby  obtained  a  new  and  independent 
series  of  measurements  of  the  same  objects.  Next,  a  woman  had  the 
same  number  of  shots  at  some  set  numbers  in  a  similar  manner.  If, 
then,  our  above  reasoning  and  formulas  (s"ee  below)  are  correct,  it  should 
be  possible,  by  observing  the  numbers  hit  and  working  out  their  corre- 
lations, to  ascertain  the  exact  resemblance  between  the  series  aimed  at 
by  the  man  and  the  woman  respectively.  In  actual  fact,  the  series  of 
numbers  hit  by  the  man  turned  out  to  correlate  with  those  hit  by  the 
woman  to  the  extent  of  0-52;  but  it  was  noticed  that  the  man's  sets 
correlated  with  one  another  to  0-74,  and  the  woman's  sets  with  one 
another  to  0-36;  hence  the  true  correspondence  between  the  set  aimed 

*  Amer.  Journ.  of  Paychol  1904,  xv.  p.  271. 
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at  by  the  man  and  that  aimed  at  by  the  woman  was  not  the  raw  0-52,  but 

0-52 

*  V(074  x  0-36) 

that  is  to  say,  the  two  persons  had  fired  at  exactly  the  same  series  of 
bands,  which  was  really  the  case." 

It  will  be  seen  that  the  formula  for  correction  is  "divide  the  raw 
correlation  by  the  geometrical  mean  of  the  two  reliability  coefficients." 
This  formula,  or  rather  the  more  accurate  formula  of  which  it  is  an 
approximate  form,  we  shall  prove  presently.  Meanwhile  some  general 
remarks  on  it  may  be  made.  Clearly,  the  formula  will  always  increase 
the  raw  correlation,  supposing,  as  is  in  practice  usually  the  case,  that 
the  correlations  concerned  are  all  positive.  As  we  have  seen,  it  can  at 
once  be  admitted  that  a  perfect  correlation  will  of  course  be  reduced  by 
observational  errors,  so  that  the  raw  correlation  will  need  an  increase. 
Similarly,  it  may  on  common-sense  grounds  be  allowed  that  even  when 
the  true  correlation  is  not  perfect  but  only  very  high,  the  same  will  be 
true,  but  with  less  and  less  certainty  as  the  correlation  is  really  less  and 
less  perfect. 

In  the  example  given,  the  true  correlation  was  really  perfect,  and 
therefore  it  is  clear  that  any  upward  correction  will  be  an  improvement, 
provided  that  it  does  not  go  past  unity.  But  suppose  we  take  a  case  where 
the  true  correlation  is  not  unity,  but  some  less  amount.  The  following 
is  an  actual  trial.  Again  two  persons  fired  at  a  target,  a  man  and  a 
woman.  The  correlation  between  the  series  of  numbers  aimed  at  by 
the  man  and  that  aimed  at  by  the  woman,  was  0-74.  The  measured 
correlation  however  between  the  series  really  struck  by  the  man  and  that 
really  struck  by  the  woman  was  0*65.  The  reliability  coefficients  were, 
for  the  man,  0-69  and  for  the  woman,  0*79.  The  corrected  value  of  the 
measured  correlation  is  therefore 

-  °'65  _  0.88 
V(0-69  x  0-79) 

which  overshoots  the  mark,  being  0*14  greater  than  the  true  value, 
whereas  the  uncorrected  value  was  only  0-09  too  small. 

The  fact  is,  that  the  assumptions  underlying  Professor  Spearman's  for- 
mula are  never  fulfilled  in  practice.  What  these  assumptions  are  can  best 
be  seen  from  the  elegant  proof  given  for  the  formula  by  Mr  Udny  Yule*. 

*  See  Appendix  (e)  to  Professor  Spearman's  article,  Brit,  Journ.  of  Psychol.  1910,  m. 
p.  294.  There  is  a  printer's  mistake  of  omission  in  the  last  formula.  See  also  W.  Brown, 
"Some  Experimental  Results  in  Correlation,"  Comptes  JRendus  du  VI™  Congres  Inter- 
national de  Psychologie,  Geneve,  1910,  where  the  same  proof  is  quoted. 
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x^  and  y±  are  measures  of  x  and  y  at  a  certain  series  of  measurements, 
x2  and  y2  are  measures  of  x  and  y  at  another  series  of  measurements. 

Let  xl  =  x  +  S1?     #2  =  x  +  82, 

Vi  =  y  +  €i>   y*  =  y  +  €*> 

all  terms  denoting  deviations  from  means. 

Then,  if  it  is  assumed  that  8,  €,  the  errors  of  measurement,  are 
uncorrelated  with  one  another  or  with  x  or  y, 

S  (zS)  etc.  =  0, 
8  tayi)  =  S  (xy). 

Hence  Wi0^**  =  rxyaxvy, 

and  similarly  rXiV^X2ay2  =  rxyaxayy 


or  r  *  =  r 


4  __ 

,  4 

y 


'  But  also,  since        S  (z8)  =  0,     S  (x^)  =  S  (xz), 
and  ^1x2opa;i^«2  =  t7a!2 

of  °f 

or  0x,crX9  =  -    -   and  av,aV9  =  -=-  , 


or 


»v»v» 

Geom.  Mean  of  correlation  coefficients 


_ 

w       Geom.  Mean  of  reliability  coefficients 

It  is  this  formula  which  is  employed  in  the  example  given  above  of 
shooting  at  a  target,  the  numerator  there  however  consisting  only  of 
r(xl+xt)(vi+vz>>  *^e  subcalculations  for  rXlVl  etc.  not  being  made. 

Attention  must  be  drawn  to  the  assumption  that  the  errors  of 
measurement  8  and  e  are  uncorrelated  with  each  other  or  with  x  or  y. 
^  Now,  these  are  very  large  assumptions  to  make.  Even  in  cases  where 
the  quantities  8,  €  are  genuine  errors  of  measurement,  there  are  strong 
reasons  for  assuming  (on  general  principles  and  also  from  experimental 
evidence)*  that  they  will  be  correlated.  But  in  the  case  of  almost  all 
the  simpler  mental  tests  the  quantities  8  and  e  are  not  errors  of  measure- 
ment at  all.  They  are  the  deviations  of  the  particular  performances  from 
the  hypothetical  average  performance  of  the  several  individuals  under 
consideration.  Thus  they  represent  the  variability  of  performance  of 

*  See  Karl  Pearson,  "On  the  Mathematical  Theory  of  Errors  of  Judgment,  with 
special  reference  to  the  Personal  Equation,"  Phil.  Trans,  cxcvui.  A,  pp.  235 — 299. 


CH.  vm]  CORRECTION  OF  CORRELATION  COEFFICIENTS     159 

function  within  the  individual.  When  an  individual  in  the  course  of 
three  minutes  succeeds  in  striking  through  100  e's  and  r's  in  a  page  of 
print  on  one  day,  and  94  under  the  same  conditions  a  fortnight  later, 
there  is  no  error  of  observation  involved.  T.he  numbers  100  and  94  are 
the  actual  true  measures  of  ability  on  the  two  occasions.  The  average 
or  mean  ability,  which  is  the  more  interesting  measure  for  the  purposes 
of  correlation,  is  doubtless  different  from  either,  but  that  does  not  make 
the  other  two  measures  erroneous.  Evidently  in  these  cases  8  and  e 
represent  individual  variability,  and  to  assume  them  uncorrelated  with 
one  another  or  with  the  mean  values  of  the  functions  is  to  indulge  in 
somewhat  a  priori  reasoning. 

There  are  two  comparatively  simple  ways  of  testing  the  assumption  : 

(1)   8  (xiyi)  =  8  (xy)  =  8  (x^2), 

.'.  S  (x-^y-^  —  S  (x2y2)  should  =  0  within  the  limits  of  the  probable 
error  of  the  difference. 

Brown  has  applied  this  test  to  the  case  of  correlation  between 
accuracy  in  bisecting  lines  and  accuracy  in  trisecting  them  in  43  adult 
subjects. 

Here  S  (bjj  -  S  (b2t2)  =  137780  -  60036 

=  77744, 


P.E.  of  8  (xy)  =  -67449  ^/Pz2  ~  P™P°2  ,  in  Pearson's  notation, 

=  -67449     /S  (xyY      S  (a?)  S  (y*) 
^n    V  "     n  n2 

P.E.  of  S  (b^)  =  687;  P.E.  of  8  (b&)  =  365; 


/.   P.E.  of  S  (Vi)  -  S  (6ay  =  V6872  +  3652 

=  778. 

Since  778  is  less  than  one-third  of  77744,  the  formula  cannot  be 
employed  to  obtain  the  correlation  between  mean  abilities  in  bisecting 
and  trisecting  lines. 

S  {(x-.  —  xz)  (y-,  —  2/0)} 
(2)    TX..X.- 


=  0, 
if  errors  are  uncorrelated  with  one  another  (since  numerator  then  =  0). 
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Applying  this  test  to  the  same  case  of  bisection  and  trisection, 
Brown  gets 

rBl-Bt  =  0-30  ±  0-09, 

Ti-Ti 

which  proves  once  more  the  inapplicability  of  the  formula. 

Brown  applied  test  (2)  also  to  the  case  of  correlation  between  speed 
of  addition  of  figures  and  accuracy  of  addition  in  a  group  of  38  school- 
children (girls  between  the  ages  of  11  and  12)  and  found 

rSl-s>  =  0-35  ±  0-09. 

Ai—At 

Even  when  test  (2)  does  give  the  value  0,  we  can  only  conclude  from 
this  that 

8  (81*1)  +  S  (82e2)  -  S  (8l68)  +  S  (8a€l). 

Replying  to  these  and  other  criticisms  of  his  formula  for  eliminating 
observational  errors,  Professor  Spearman*  admits  that  many  forms  of 
error  will  be  correlated  with  each  other  and  with  the  true  values  of  the 
quantities  measured.  But  these,  he  says,  are  generally  of  a  continuously 
progressive  nature,  and  he  proposes  to  eliminate  their  influence  by 
making  at  least  three  measurements  of  x,  and  taking  the  first  and  third 
of  these  together  as  the  xl  of  the  formula,  the  middle  one  as  x2;  or  to 
use  more  complicated  but  essentially  similar  devices.  No  doubt,  of 
course,  this  is  a  wise  precaution  to  take,  even  though  sceptics  may  still 
doubt  whether  the  remaining  so-called  "accidental"  variations  are  even 
yet  uncorrelated  with  each  other  and  with  x  and  y. 

Brownf,  using  experimental  data  obtained  by  two  independent 
observers  estimating  the  lengths  of  lines,  found  a  considerable  corre- 
lation ratio  between  the  errors  of  observation  and  the  true  lengths  of 
the  lines,  the  regression  being  very  far  from  linear.  He  is  of  opinion 
that  "no  assumptions  as  to  the  correlation  or  non-correlation  of  such 
deviations  are  in  the  least  justified." 

The  correlation  table  between  lengths  and  errors,  and  the  corre- 
sponding regression  curves,  are  shown  in  the  following  table  and  dia- 
gramj. 

In  a  review  of  Brown's  article  Mr  J.  R.  Wilton  §  has  made  some 
ingenious  suggestions  for  weighting  the  different  sc's  in  a  manner  sug- 
gested by  quadrature  formulae.  He  remarks  also  that  a  grouping 
should  be  sought  which  would  as  far  as  possible  satisfy  the  assumptions. 


*  Brit.  J&urn.  Paychol.  1910,  m.  p.  271.          f  Brit-  Jwrn.  Psychol.  1913,  vi.  p.  223. 
%  From  Brit.  J&urn.  Psychol.  1913,  vi.  pp.  236—8. 
§  Journ.  of  Exp.  Pedag.  1914,  n.  p.  302. 
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It  may  finally  be  pointed  out  that  even  if  the  errors  8  and  e  were 
known  with  certainty  to  be  uncorrelated  with  each  other  and  with  the  true 
values,  yet,  with  such  small  numbers  of  cases  as  are  used  in  many  of  the 
psychological  researches  in  which  Professor  Spearman*  s  formula  has  been 
employed,  the  chance  of,  e.g.  S  (x^y-^  being  nearly  equal  to  S  (x^/2)  is 
exceedingly  small,  and  it  is  difficult  to  attach  any  meaning  to  an  artificial, 
post  hoc  separation  of  the  data  into  halves  such  that  this  condition  is 
satisfied  (and  also  others).  The  formula  is  at  any  rate  inapplicable  to 
samples  such  as  30,  24,  52,  which  have  been  freely  used  in  experimental 
work ;  or  if  used  at  all,  it  can  only  be  as  a  guide  to  the  sufficiency  of  the 
sample.  It  is  an  essential  of  good  work  to  use  such  samples  that  correc- 
tions to  the  raw  values  obtained  are  unimportant. 

Regression  Curves  of  Correlation  Table* 

— >-     Errors  made,  in  mm. 

-21    -19- -17. .-IS    -13._r1.1_!*_tT_-6_-3    .-1      +1 


Fig.  27 
*  See  pp.  162—3. 
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CHAPTER  IX 

THE  THEOEY  OF  GENERAL  ABILITY 

This  apparent  unity  is  illusory.  Man,  in  fact,  is  a  microcosm  as  complex  as  the 
world  which  is  mirrored  in  his  mind;  he  is  a  federation  incompletely  centralised,  a 
hierarchy  of  numerous  and  conflicting  passions,  each  of  which  has  ends  of  its  own,  and 
each  of  which,  separately  considered,  would  give  a  different  law  of  conduct.  He  is  in 
some  sense  a  unit,  but  his  unity  is  such  as  to  include  an  indefinite  number  of  partly 
independent  sensibilities. 

LESLIE  STEPHEN,  The  Science  of  Ethics,  p.  69. 

Discovery  of  "Hierarchical"  order  among  correlation  coefficients — Use  of  the 
formula  for  the  correction  of  observational  errors  to  prove  the  existence  of  a 
general  factor — Researches  between  1904  and  1912 — A  criterion  for  hierarchical 
order  applied  to  numerous  researches — Complications  in  the  original  theory. 

(1)  DISCOVERY  OF  HIERARCHICAL  ORDER  AMONG  CORRELATION 

COEFFICIENTS* 

THE  controversy  as  to  whether  ability  in  any  individual  is  general,  or 
specific,  or  in  groups  or  "faculties "  is  a  very  old  one,  but  for  the  purposes 
of  the  present  chapter  it  is  not  necessary  to  go  back  prior  to  1904,  in 
which  year  there  was  published  the  first  f  of  a  series  of  articles  in  which 
Professor  C.  Spearman  has  developed  his  Theory  of  General  Ability,  or 
Theory  of  Two  Factors,  as  it  is  alternatively  named. 

Professor  Spearman's  method  in  that  paper  was  to  measure  a  number 
of  mental  abilities,  some  of  them  school  subjects,  others  artificial  tests, 
in  a  number  of  persons,  and  calculate  the  correlation  coefficients  of  each 
of  these  activities  with  each  of  the  others.  These  correlation  coefficients, 
he  then  noticed,  had  a  certain  relationship  among  themselves,  a  relation- 
ship which  may  be  called  hierarchical  order,  and  is  explained  in  detail 
later.  He  saw,  quite  rightly,  that  the  presence  of  a  general  factor  would 
produce  this  hierarchical  order  among  the  coefficients,  and,  reversing 
this  argument,  he  concluded  that  the  presence  of  hierarchical  order 
proved  the  existence  of  a  general  factor. 

In  this  first  series  of  investigations  Professor  Spearman  used  the 
following  groups  of  subjects:  24  village  school-children  of  both  sexes, 

*  Sections  1,  3  and  5  of  this  chapter  are  largely  extracts  from  an  article  by  G.  H. 
Thomson  in  the  Psychological  Review,  1920,  xxvn.  p.  173. 

f  "General  Intelligence  objectively  determined  and  measured,"  C.  Spearman,  Amer. 
Journ.  Psychol  1904,  xv.  pp.  201—293. 
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age  limits  10-0  to  13-10;  23  boys  of  a  high  class  preparatory  school, 
age  limits  9-5  —  13-7;  and  27  adults  of  both  sexes,  age  limits  21  —  78. 
The  tests  employed  were  those  for  pitch  discrimination,  weight  dis- 
crimination, and  discrimination  of  light  intensities;  and  measures  of 
intelligence  were  obtained,  in  the  case  of  the  children,  from  results  of 
school  examinations,  grading  by  teachers,  and  grading  of  one  another 
by  the  children  themselves  (measure  of  common  sense). 

The  various  school  subjects  in  the  preparatory  school  were  found  to 
correlate  highly  with  one  another,  and  when,  with  the  inclusion  of  pitch 
discrimination  and  music,  they  were  arranged  in  rows  and  columns,  it 
was  found  possible  to  place  them  in  such  an  order  that  the  correlation 
coefficients  formed  a  hierarchy,  each  being  (with  very  few  exceptions) 
greater  than  any  to  the  right  of  it  in  the  same  row,  or  below  it  in  the 
same  column,  thus 

Classics        French        English         Math.        Discrim.        Music 
Classics  ......  0-83  0-78  0-70  0-66  0-63 

French   ......         0-83  0-67  0-67  0-65  0-57 

English  ......        0-78  0-67  0-64  0-54  0-51 

Math  .......        0-70  0-67  0-64  0-45  0-51 

Discrim  .......         0-66  0-65  0-54  0-45  0-40 

Music      ......        0-63  0-57  0-51  0-51  0-40 

This  fact  of  "hierarchical"  order  which  he  had  thus  discovered  was 
taken  by  Professor  Spearman  to  indicate  the  presence  of  some  common 
fundamental  function  which  saturates  in  different  degrees  the  different 
activities,  and  is  the  sole  cause  of  correlation  between  them  except  in 
the  case  of  very  similar  activities. 

It  can  easily  be  shown  that  if  all  the  correlations  are  due  solely  to 
one  common  or  general  factor,  then  the  correlation  coefficients  will  be 
in  perfect  hierarchical  order*. 

Let  a  and  p  be  two  mental  tests  or  other  activities,  and  g  be  the 
general  factor.  Then  rav.g  is  the  correlation  that  a  would  have  with  p 

for  constant  g,  and  equals 

if     _  if  » 

'  '' 


But  if  g  is  the  sole  source  of  correlation,  rav.g  must  be  zero,  i.e. 

if    —  if     if 

'av        'ag'  'w 

Similarly  rbv  =  rbgr.pg. 

Hence  ^  =  ^   and  similarly  =  X 

rbg         rbp  fm 

*  Hart  and  Spearman,  Brit.  J&urn.  PsychoL  1912,  v.  p.  58  quoting  Yule.    Previous 
though  less  satisfactory  proofs  had  also  been  given  by  Spearman. 
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A  little  consideration  of  this  last  equation  shows  that,  if  it  be  true 
for  any  four  of  the  tests,  it  implies  the  possibility  of  arranging  the  corre- 
lation coefficients  in  the  order  we  have  termed  "hierarchical" :  and  more 
than  this,  that  the  values  of  r  in  any  one  column  of  the  "hierarchy" 
will  bear  a  constant  ratio,  each  to  each,  to  their  partners  in  any  other 
column. 

Since  clearly  perfect  hierarchical  order  cannot  be  expected  in  any 
experimental  research,  it  becomes  important  to  know  what  deviation 
from  perfection  can  be  allowed  without  giving  up  the  idea  of  a  general 
factor:  or  on  the  other  hand,  what  approach  to  perfection  can  be  attained 
without  the  presence  of  a  general  factor.  These  questions  will  occupy 
us  presently.  Meanwhile  we  turn  for  a  while  to  another  form  of  argument 
used  by  Professor  Spearman. 


(2)  USE  OF  THE  FORMULA  FOR  THE  CORRECTION  OF  OBSERVATIONAL 
ERRORS  TO  PROVE  THE  EXISTENCE  OF  A  GENERAL  FACTOR 

He  considered  that  by  the  use  of  his  formula  for  the  correction  of 
observational  errors  he  could  demonstrate  the  same  thing  (the  existence 
of  a  "  central  function  "),  and  could  in  particular  show  "  that  the  common 
and  essential  element  in  the  intelligences  wholly  coincides  with  the 
common  and  essential  element  in  the  sensory  functions*."  The  method 
of  proof  is  as  follows  : 

Let  xl9  x2  be  two  distinct  measures  of  sensory  discrimination,  and 
*/i>  2/2  ^wo  distinct  measures  of  intelligence. 

Then  the  correlation  of  the  function  common  to  the  functions 
measured  by  xl9  x2  with  the  function  common  to  the  functions  measured 

by  2/i>  2/2  is  e(iuai  to  _ 

V  fx      rx      rx      rx 


and  if  the  two  functions  referred  to  are  identical  this  expression  should 
be  equal  to  unity. 

In  the  present  article,  Spearman  uses  a  simplified  formula, 


and  puts  for  the  numerator  the  average  of  the  various  correlations 
*  Amer.  Journ.  Psychol  1904,  xv.  p.  269. 
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evaluated  between  the  intelligences  and  the  discriminations,  and  in  the 
denominator  puts 

rx1x2  =  the  average  correlation  of  the  intellective  gradings  with  one 
another, 

rvn/2  =  the  average  correlation  of  the  gradings  in  discrimination  with 

one  another, 

and  in  this  way  gets  results  approximately  equal  to  1  in  the  different 
groups  tested. 

One  or  two  remarks  may  appropriately  be  made  here.  In  the  first 
place,  the  full  formula  is  the  only  one  that  can  be  used  with  any  meaning 
or  justice  since  it  is  the  only  one  which  issues  logically  from  the  mathe- 
matical proof.  In  the  second  place,  the  applicability  of  the  true  formula 
must  be  considered  in  the  light  of  its  presuppositions  (mentioned  above, 
p.  158). 

Indeed,  the  assumption  that  8  and  e  are  uncorrelated  with  each  other 
or  with  x  or  y  seems  even  more  unwarrantable  here  than  in  the  case  of 
"correcting"  coefficients,  for  which  the  formula  was  originally  devised. 

(3)  RESEARCHES  BETWEEN  1904  AND  1912 

A  number  of  experimental  researches  on  these  lines,  in  some  of 
which  Professor  Spearman  himself  took  part,  were  carried  out  during 
the  eight  years  following  1904,  but  with  very  conflicting  results,  some 
experimenters  finding  the  hierarchical  order  among  the  coefficients, 
others  finding  no  such  order.  Two  articles  of  this  period,  for  example, 
are  those  of  Mr  Cyril  Burt*,  who  found  practically  perfect  hierarchical 
order,  and  Dr  William  Brownf,  who  found  small  trace  of  such  order. 
A  similar  conflict  of  opinion  was  found  with  regard  to  the  alternative 
method  of  attack,  as  for  example  in  the  research  by  Messrs  Thorndike, 
Lay  and  DeanJ.  The  subjects  examined  were  37  young  women  students 
and  25  high  school  boys.  The  tests  for  sensory  discrimination  were: 

(1)  drawing  lines  equal  to  given  lines,  and 

(2)  filling  boxes  with  shot  to  equal  in  weight  standard  weights; 
those  for  intelligence  were : 

(3)  judgment  of  fellow-students,  and 

(4)  judgment  of  teachers. 

*  Cyril  Burt,  "Experimental  Tests  of  General  Intelligence,"  Brit.  Journ.  Psychol. 
1909,  m.  pp.  94—177. 

f  William  Brown,  "Some  Experimental  Results  in  the  Correlation  of  Mental  Abilities," 
BriL  Journ.  Psychol.  1910,  m.  pp.  296—322. 

J  Thorndike,  Lay  and  Dean,  "The  Relation  of  Accuracy  in  Sensory  Discrimination 
to  General  Intelligence,"  Amer.  Jonrn.  Psychol.  July  1909,  xx.  pp.  364 — 369. 
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For  the  high  school  boys  (3)  and  (4)  were  combined  teachers'  and 
fellow-students'  judgments  and  school  marks,  respectively. 

In  the  first  case,  Spearman's  formula  gave  for  the  correlation  of  the 
factor  common  to  (1)  and  (2)  with  that  common  to  (3)  and  (4)  the  value 
0-26  instead  of  1-00.  In  the  second  case,  the  value  was  0-29.  Moreover, 
Thorndike  found  a  much  higher  correlation  between  discrimination  of 
lengths  and  discrimination  of  weights  than  between  either  one  of  them 
and  general  intelligence,  the  coefficients  being 

Accuracy  in  drawing  lines,  intelligence     0-15, 

Accuracy  in  making  up  weights,  intelligence       ...     0-25, 
Accuracy  in  drawing  lines  and  making  up  weights    0-50. 
Thus  the  results  were  in  decided  conflict  with  both  parts  of  Spearman's 
concluding  statement  "that  all  branches  of  intellectual  activity  have 
in  common  one  fundamental  function  (or  group  of  functions)  whereas 
the  remaining  or  specific  elements  of  the  activity  seem  in  every  case 
to  be  wholly  different  from  that  in  all  the  others*." 

Thorndike  sums  up  as  follows:  "In  general  there  is  evidence  of  a 
complex  set  of  bonds  between  the  psychological  equivalents  of  both 
what  we  call  the  formal  side  of  thought  and  what  we  call  its  content,  so 
that  one  is  almost  tempted  to  replace  Spearman's  statement  by  the 
equally  extravagant  one  that  there  is  nothing  whatever  common  to  all 
mental  functions,  or  to  any  part  of  themf." 

Things  were  in  this  very  unsatisfactory  state  when  an  important 
article  by  Professor  Spearman,  in  cooperation  with  Dr  Bernard  Hart, 
appeared  in  1912J.  In  this  article  the  difficulty  of  making  an  unbiassed 
judgment  as  to  the  presence  or  absence  of  hierarchical  order  was  recog- 
nised, and  a  form  of  calculation  was  given  for  obtaining  a  numerical 
criterion  of  the  degree  of  perfection  of  hierarchical  order,  which  criterion 
would  be  independent  of  any  bias  on  the  part  of  the  calculator  and  would, 
it  was  hoped,  give  the  true  amount  of  hierarchical  order,  corrected  for 
the  sampling  errors  of  experiment.  This  criterion  ranges  theoretically 
from  zero,  for  absence  of  hierarchical  order,  to  unity,  for  perfection 
of  hierarchical  order.  But  their  formula  can,  arithmetically,  exceed 
unity. 

*  C.  Spearman,  Amer.  Journ.  Psychol.  xv.  p.  284. 
t  Op.  cit.  p.  368. 

J  "General  Ability,  its  Existence  and  Nature,"  by  B.  Hart  and  C.  Spearman,  Brit. 
Journ.  Psychol.  1912,  v.  pp.  51—84. 
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(4)  A  CRITERION  FOR  HIERARCHICAL  ORDER 

The  underlying  idea  was  that  if  the  above  square  table  *  of  correlation 
coefficients  shows  hierarchical  order  in  any  degree,  there  will  be  corre- 
lation between  the  columns  of  that  table  taken  in  pairs,  and  that  when 
the  hierarchical  order  is  perfect  the  columnar  correlation  R  will  rise  to 
unity,  except  in  so  far  as  it  is  blurred  by  the  sampling  errors,  which 
obviously  cannot  increase  an  already  perfect  correlation,  but  can  only 
decrease  it.  Let  us  write  dashed  letters  throughout  for  the  true  values 
of  the  various  quantities,  which  in  ordinary  experiment  are  unknown, 
reserving  undashed  letters  for  their  measured  values.  We  then  have: 

r'  =  true  correlation  coefficient, 
e  =  its  sampling  error  on  one  occasion,  so  that 

^  =  r'  +  e, 

r'  =  mean  of  the  column  of  true  values  r1  ', 
r  =  mean  of  the  column  of  observed  values  r. 

In  finding  these  means,  that  coefficient  is  omitted  which  has  no  partner 
in  the  column  with  which  correlation  is  being  found.  Write  also 

p  =  /  measured  from  the  mean  of  the  true  column,  i.e. 

=  /  —  r',  and  similarly 
p  —  r  measured  from  the  mean  of  the  observed  column,  i.e. 


where  e  is  the  mean  of  the  column  of  e's. 

Then  for  two  columns  a  and  6,  the  true  columnar  correlation  which 
we  desire  to  know  is 

Of   /  f  t\ 

D    /  "  (Pm  Pa>  I  /i\ 

'*  S 


by  the  Bravais-Pearson  product-moment  formula,  S  indicating  summa- 
tion over  the  various  values  of  x,  i.e.  summation  up  the  column.  This 
can  be  written 

ft    '  =  _  S  (pxapa)  ~  S  (6^6^)  -  S  (/Paft'Cgg)  -  S  (p^  '  €&) 


In  this  expression,  the  three  quantities  of  the  form  S  (pp)  are  known. 
The  three  quantities  of  the  form  S  (ee)  are  not  known,  but  an  attempt 
can  be  made  to  estimate  their  probable  values  from  the  known  standard 
deviations  of  the  correlation  coefficients.  The  four  quantities  of  the 

*  p.  165. 
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form  S  (p'e)  are  treated  by  Dr  Hart  and  Professor  Spearman,  in  their 
paper,  as  negligible,  on  the  ground  that  p  will  not  in  general  be  correlated 
with  e.  This  assumption,  as  we  shall  presently  see,  was  erroneous. 

The  formula  at  which  Dr  Hart  and  Professor  Spearman  eventually 
arrive,  after  neglecting  these  quantities  and  making  various  other 
assumptions,  is 


=  _  -(n-l)  rabaxaaxb  _  ^_ 

V{S  (Pxa*)  -{n-l)a^}  V(S  M  -  (n  -  1)  ^fl 

where  the  o-'s  are  standard  deviations  of  the  correlation  coefficients,  the 
bar  indicates  mean  values  for  the  column,  and  n  is  the  number  of 
pairs  of  correlation  coefficients  concerned,  in  the  two  columns.  In  using 
their  formula,  its  authors  do  not  apply  it  to  all  the  pairs  of  columns 
in  the  square  table.  They  say:  "In  any  case  the  correction  must  be 
kept  within  limits  :  as  usual,  the  larger  the  correction  the  less  it  is  to  be 
trusted.  If  the  sampling  errors  are  large  enough,  they  eventually  will 
quite  swamp  the  true  differences  of  magnitude  upon  which  the  observed 
correlation  should  be  based.  In  this  case,  the  true  correlation  is  beyond 
ascertainment  ;  any  attempt  at  correction  is  merely  illusory.  To  avoid 
this,  and  at  the  same  time  to  ensure  impartial  treatment  of  all  data, 
it  is  necessary  to  fix  beforehand  some  definite  limit  to  the  feasibility  of 
correction.  We  have  here  adopted  the  following  standard:  in  order  to 
attempt  to  estimate  the  correct  correlation  between  columns,  it  is 
required  that  in  each  of  these  columns  the  mean  square  deviation  should  be 
at  least  double  the  correction  to  be  applied  to  that  deviation" 

That  is  to  say,  the  equation  (2)  is  not  to  be  used  unless,  in  each  factor 
of  the  denominator,  S  (p2)  is  at  least  double  its  correction  (n  —  1)  cr2. 
This  condition  (the  "correctional  standard")  will  be  found  to  be  im- 
portant. 

The  authors  applied  their  criterion  to  all  the  experimental  work 
available,  work  dating  from  various  periods,  and  representing  the  re- 
searches of  14  experimenters  on  1463  men,  women,  boys  and  girls. 
From  beginning  to  end  the  values  of  the  criterion  were  positive  and  very 
high.  The  mean  was  almost  complete  unity.  That  is  to  say,  Dr  Hart 
and  Professor  Spearman  claimed  that  all  the  data  then  available  showed 
perfect  hierarchical  order  among  the  correlation  coefficients,  even  the 
data  of  workers  like  Dr  Brown  and  Professor  Thorndike,  who  had  been 
unable  to  detect  any  such  order.  The  reasons  why  the  hierarchical  order 
among  the  correlation  coefficients  was  not  obvious  at  a  glance  were, 
according  to  these  authors,  two.  In  the  first  place,  their  theory  did  not 
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entirely  deny  the  presence  of  Group  Factors  of  narrow  range,  and  tests 
which  were  too  similar  were,  according  to  them,  to  be  pooled,  before  the 
hierarchical  order  would  become  apparent.  Only  in  very  few  cases  however 
did  they  find  it  necessary  to  pool  tests  in  the  data  used.  In  the  second 
place,  the  obscuring  of  the  perfect  hierarchical  order  was,  according  to 
them,  due  to  the  fact  that  only  a  small  sample  of  subjects  is  examined.  For 
this  error  allowance  is  made  in  the  formula  for  calculating  their  criterion. 

Dr  Hart  and  Professor  Spearman  therefore  considered  their  "  Theory 
of  Two  Factors "  proved.  This  theory  considers  ability  in  any  activity 
to  be  due  to  two  factors.  One  of  these  is  a  General  Factor,  common  to 
all  performances.  The  other  is  a  Specific  Factor,  unique  to  that  par- 
ticular performance,  or  at  any  rate  extending  only  over  a  very  narrow 
range  including  only  other  very  similar  performances.  "It  is  not 
asserted,"  they  say,  "that  the  General  Factor  prevails  exclusively  in 
the  case  of  performances  too  alike,  but  only  that  when  this  likeness  is 
diminished,  or  when 'the  resembling  performances  are  pooled  together, 
a  point  is  soon  reached  where  the  correlations  are  still  of  considerable 
magnitude,  but  now  indicate  no  common  factor  except  the  General  one." 

In  the  same  paper  Dr  Hart  and  Professor  Spearman  consider,  and  in 
their  opinion  confute,  two  other  theories,  (a)  the  older  view  of  Professor 
Thorndike,  viz.  a  general  independence  of  all  correlations,  and  (6)  Pro- 
fessor Thorndike's  newer  view  of  "levels,"  or  the  almost  universal  belief 
in  "types."  If  the  former  were  true,  their  criterion  would,  they  consider, 
show  an  average  value  of  about  zero :  if  the  latter,  a  low  minus  value. 

Their  argument  runs  as  follows. 

If  none  but  quite  Specific  Factors  are  present,  the  correlations  will 
all  be  zero,  and  the  pairs  of  columns  will  show  no  correlation  with  one 
another.  If  however  correlations  exist,  but  are  due  to  Group  Factors 
alone,  then  tests  which  share  a  Group  Factor  will  correlate  highly,  but 
others  will  not  correlate  at  all.  Let  there  be  three  such  Group  Factors; 
then  we  shall  obtain  not  a  hierarchy  but  an  arrangement  like  this: 
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in  which  the  high  correlations  are  concentrated  along  the  diagonal.  In 
this  arrangement  some  columns  will  correlate  positively,  namely  those 
in  which  the  high  correlations  come  opposite  one  another;  but  these 
will  be  in  the  minority  and  most  pairs  of  columns  will  correlate  negatively. 
Professor  Spearman  and  Dr  Hart  conclude  therefore  that  in  the  absence 
of  a  General  Factor  the  average  correlation  between  columns  will  be 
either  zero  or  negative,  and  that  only  a  General  Factor  will  give  a  very 
high  positive  correlation  between  pairs  of  columns. 

In  this  consideration  of  Group  Factors  however  it  has  been  tacitly 
assumed  that  there  is  no  overlapping  of  such  factors.  If  this  were  so 
then  indeed  a  hierarchy  would  be  impossible.  But  it  is  at  any  rate  a 
conceivable  hypothesis  that  such  overlapping  should  occur,  that  for 
example  there  might  exist  a  factor  common  to  three  tests  a,  b,  c  and 
another  common  to  c,  d,  e,  so  that  c  contains  both  factors:  and  on  this 
hypothesis  an  excellent  hierarchy  can  be  obtained  without  any  General 
Factor,  and  the  average  column  correlation  can  even  approach  unity, 
as  we  shall  show  presently. 

(5)  COMPLICATIONS  IN  THE  ORIGINAL  THEORY 

Many  experimental  researches  were  inspired  by  this  paper  of  Dr  Hart 
and  Professor  Spearman,  of  which,  as  a  good  example,  may  be  cited 
one  in  1913  by  Mr  Stanley  Wyatt*.  It  is  not  too  much  to  say  that 
in  practically  all  of  these  the  application  of  the  Hart  and  Spearman 
criterion  gave  values  closely  approximating  to  unity  and  therefore  sup- 
porting the  Theory  of  General  Ability.  But  complications  began  to 
arise,  of  which  the  first  of  importance  will  be  found  in  Dr  Edward  Webb's 
monograph  on  "Character  and  Intelligence,"  in  1915f.  Dr  Webb  con- 
sidered that  he  had  found  (in  addition  to  Professor  Spearman's  General 
Ability)  a  second  general  factor,  which  he  calls  "persistence  of  motives." 
Other  writers  began  to  find  that  their  data  required  for  their  explanation 
large  Group  Factors,  of  wider  range  than  those  contemplated  in  the 
original  form  of  Professor  Spearman's  theory}.  Quite  recently  Mr  J.  C. 
Maxwell  Garnett,  discussing  the  data  of  a  number  of  workers  with  the 
aid  of  mathematical  devices  which  he  has  introduced  for  the  purpose, 

*  Stanley  Wyatt,  "The  Quantitative  Investigation  of  Higher  Mental  Processes," 
Brit.  Jmtrn.  Psychol.  1913,  vi.  pp.  109 — 133. 

t  E.  Webb,  "Character  and  Intelligence,"  Brit.  Journ.  Paychol,  Monog.  Supplement, 
1915,  No.  3,  pp.  ix  and  99. 

t  See  especially  N.  Carey,  "Factors  in  the  Mental  Processes  of  School  Children," 
Brit.  Journ.  Paychol.  1916,  vm.  pp.  170 — 182. 
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concludes  that  in  addition  to  the  single  general  factor  of  Professor 
Spearman,  there  are  two  large  Group  Factors  which  are  practically 
general*  (one  of  them  being  indeed  almost  identical  with  Dr  Webb's 
second  general  factor),  which  he  calls  respectively  "Cleverness"  and 
"Purpose,"  both  distinct  from  General  Ability. 

It  is  clear  therefore  that  in  any  case  the  simple  original  form  of 
Professor  Spearman's  theory  is  becoming  complicated  by  additions 
which  tend  to  modify  it  very  considerably.  Meanwhile,  however,  one 
of  us  had  come  definitely  to  the  conclusion  that  the  mathematical 
foundations  upon  which  it  was  based  were  in  fact  incorrect.  Before 
developing  the  line  of  argument  which  led  to  this,  it  will  be  well  to  re- 
state Professor  Spearman's  case  in  its  simplest  terms  in  a  few  words. 

It  is  entirely  based  upon  the  observation  and  measurement  of  hierarchical 
order  among  correlation  coefficients.  It  states  that  after  allowance  has  been 
made  for  sampling  errors  this  hierarchical  order  is  found  practically  in 
perfection.  And  it  finally  states  that  such  a  high  degree  of  perfection  can 
only  be  produced  by  a  General  Factor,  and  the  absence  of  Group  Factors, 
which  would  mar  the  perfection. 

*  J.  C.  Maxwell  Garnett,  "General  Ability,  Cleverness,  and  Purpose,"  Brit.  Jwirn. 
Psychol.  1919,  ix.  pp.  345—366. 


CHAPTER  X* 
A  SAMPLING  THEORY  OF  ABILITY 

The  case  against  the  validity  of  Professor  Spearman's  argument — Hierarchical 
order  produced  by  random  overlap  of  group  factors,  without  any  general  factor — 
Application  of  the  "criterion"  to  these  cases,  apparently  proving  the  presence  of 
a  general  factor — The  erroneous  nature  of  the  "criterion" — Hierarchical  order  the 
natural  order  among  correlation  coefficients — A  sampling  theory  of  ability — Transfer 
of  training — Conclusions. 

(1)  THE  CASE  AGAINST  THE  VALIDITY  OF  PROFESSOR  SPEARMAN'S 

ARGUMENT 

As  we  have  already  seen  in  previous  chapters,  it  is  possible,  by  means 
of  dice  throws  or  in  other  ways,  to  make  artificial  experiments  on 
correlation,  with  the  immense  advantage  that  the  machinery  producing 
the  correlation  is  known,  and  that  therefore  conclusions  based  upon  the 
correlation  coefficients  can  be  confronted  with  the  facts.  Working  on 
these  lines,  one  of  usf  made,  in  1914,  a  set  of  imitation  "mental  tests" 
(really  dice  throws  of  a  complicated  kind),  which  were  known  to  contain 
no  General  Factor.  The  correlations  were  produced  by  a  number  of 
Group  Factors  which  were  of  wide  range,  and,  unlike  Professor  Spearman's 
Specific  or  Narrow  Group  Factors,  they  were  not  mutually  exclusive. 

These  imitation  mental  tests,  containing  no  General  Factor,  gave 
however  a  set  of  correlation  coefficients  in  excellent  hierarchical  order, 
and  the  criterion  was  when  calculated  found  to  be  unity,  so  that  had 
these  correlation  coefficients  been  published  as  the  result  of  experimental 
work,  they  would  have  been  claimed  by  Professor  Spearman  as  proving 
the  presence  of  a  General  Factor.  In  a  short  reply  Professor  Spearman 
laid  stress  on  the  fact  that  this  arrangement  of  Group  Factors  which 
thus  produced  practically  perfect  hierarchical  order  was  not  a  random 
arrangement,  that  it  was  exceedingly  improbable  that  this  one  special 
arrangement  should  have  occurred  in  each  of  the  psychological  researches 
of  many  experimenters,  so  improbable  indeed  as  to  be  ruled  entirely 
out  of  court  J,  and  that  a  random  arrangement  of  Group  Factors,  though 

*  Much  of  this  chapter,  and  part  of  the  preceding,  consists  of  extracts  from  "General 
versus  Group  Factors  in  Mental  Activities,"  by  G.  H.  Thomson,  Psychol.  Review,  1920, 
xxvn.  p.  173. 

f  Godfrey  H.  Thomson,  "A  Hierarchy  without  a  General  Factor,"  Brit.  Journ. 
Psychol.  1916,  vm.  pp.  271—281. 

J  C.  Spearman,  '.'Some  Comments  on  Mr  Thomson's  Paper,"  Brit.  Journ.  Psychol. 
1916,  vm.  p.  282. 
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it  might  give  some  hierarchical  order,  would  not  give  it  in  the  perfection 
actually  found.  The  obvious  way  to  find  out  if  this  is  so  or  not  is  to  try  it, 
with  artificial  "  mental  tests  "  formed  of  dice  throws.  This  was  done  in 
November  and  December  of  1918,  after  an  unavoidable  delay  of  some 
years.  Sets  of  artificial  variables  (analogous  to  the  scores  in  mental 
tests)  were  made,  in  each  of  which  the  arrangement  of  Group  Factors 
was  decided  by  the  chance  draws  of  cards  from  a  pack*.  It  was  found 
that  hierarchical  order  resulted,  which  when  measured  by  the  "criterion" 
appeared  to  be  perfect. 


(2)  HIERARCHICAL  ORDER  PRODUCED  BY  RANDOM  OVERLAP  OF 
GROUP  FACTORS,  WITHOUT  ANY  GENERAL  FACTOR! 

Write  down  the  letters  xlt  x2,  x3,  ...  as  the  names  of  the  variates  to 
be  formed,  and  prepare  columns  to  receive  the  numbers  of  group  factors 
and  specific  factors  in  each  variate.  Determine  each  number  by  the 
draw  of  a  card  from  an  ordinary  playing  pack,  returning  the  card  and 
shuffling  between  each  draw :  the  knave,  queen,  and  king  may  be  counted 
11,  12,  and  13  respectively.  The  result  of  one  such  set  of  drawings  is 
shown  in  this  table: 


Variate 

Group  factors 

Specific  factors 

Total 

xi 

5 

6 

10 

xz 

5 

3 

8 

xa 

12 

12 

24 

xt 

1 

3 

4 

*6 

7 

6 

13 

*e 

9 

5 

14 

X7 

13 

13 

26 

X8 

1 

2 

3 

X9 

9 

3 

12 

*tt 

11 

5 

16 

Proceed  next  to  identify  the  group  factors  of  each  variate.  Do  this 
by  using  a  single  suit  of  the  pack.  After  shuffling  it  well,  lay  out  the 
top  five  cards  to  represent  the  five  group  factors  in  xl9  and  note  them. 

*  Godfrey  H.  Thomson,  "On  the  Cause  of  Hierarchical  Order  among  the  Correlation 
Coefficients  of  a  Number  of  Variates  taken  in  Pairs,"  Proceedings  of  the  Royal  Society  of 
London,  1919,  xcv.  A,  pp.  400 — 408.  See  also,  by  the  same  author,  "The  Hierarchy  of 
Abilities,"  and  "The  Proof  or  Disproof  of  the  Existence  of  General  Ability,"  in  Brit. 
Journ.  Psychol  1919,  ix.  pp.  321—344. 

f  This  section  and  also  section  5  consists  largely  of  extracts  from  the  Proc.  Roy.  Soc. 
1919,  xcv.  A,  pp.  400 — 408. 
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After  replacing  them  and  reshuffling,  do  the  same  for  xz,  and  so  on,  as 
in  this  table: 


Ace      2        3        4        5        6        7        8        910KnQK 


The  next  step  is  to  note  the  number  of  factors  common  to  each  pair  of 
variates,  as  in  this  table: 


xl 

*, 

*3 

*4 

*5 

X6 

*7 

X8 

X9 

*10 

*1 



2 

5 

0 

5 

2 

5 

1 

5 

4 

x± 

2 

— 

5 

3 

5 

5 

0 

2 

4 

xt 

5 

5 

— 

7 

8 

12 

1 

8 

10 

X4 

0 

1 

1 

— 

1 

1 

1 

0 

0 

1 

X6 

5 

3 

7 

— 

4 

7 

1 

5 

6 

xt 

2 

5 

8 

4 

— 

9 

0 

5 

8 

X, 

5 

5 

12 

7 

9 

— 

1 

9 

11 

•i 

1 

0 

1 

0 

1 

0 

1 

— 

1 

1 

x% 

5 

2 

8 

0 

5 

5 

9 

1 

— 

8 

*10 

4 

4 

10 

1 

6 

8 

11 

1 

8 

— 

From  these,  and  from  the  total  number  of  factors  both  specific  and  group 
in  each  variate,  can  be  found  the  correlation  which  would  occur  between 
the  variates  were  we  to  throw  dice,  one  to  each  factor,  and  repeat  the 
throwings  a  large  number  of  times.  The  formula  is 


r  = 


Number  of  common  factors 
Geometrical  mean  of  totals* 


This  formula  is  applicable  not  only  to  variates  formed  by  the  addition 
of  dice,  but  to  variates  which  are  any  function  of  the  factors  or  elements, 
provided  that  the  form  of  the  function  is  the  same  in  each  variate,  and 
that  the  standard  deviation  is  the  same  for  each  element  or  factor*. 
We  thus  obtain  the  following  table  of  theoretical  correlation  coeffi- 
cients : 


*  G.  H.  Thomson,  loc.  dt.  p.  276.   It  can  readily  be  deduced  from  Bravais,  Memoires 
de  I'lnstitut  de  France,  1846,  ix.  eqn.  28. 
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X,     X2     *,     X.     X5     X6     X,     XB     X9     XU 

Totals 

xl 



0-22 

0-32 

0-00 

0-44 

0-17 

0-31 

0-18 

0-46 

0-32 

2-42 

0-22 



0-39 

0-18 

0-29 

0-47 

0-35 

0-00 

0-20 

0-35 

2-45 

X3 

0-32 

0-39 

— 

0-10 

0-39 

0-44 

0-48 

0-12 

0-62 

0-51 

3-37 

0-00 

0-18 

0-10 



0-14 

0-13 

0-10 

0-00 

0-00 

0-12 

0-77 

XB 

0-44 

0-29 

0-39 

0-14 

— 

0-30 

0-38 

0-16 

0-40 

0-41 

2-91 

X 

0-17 

0-47 

0-44 

0-13 

0-30 

— 

0-47 

0-00 

0-38 

0-53 

2-89 

X1 

0-31 

0-35 

0-48 

0-10 

0-38 

0-47 

— 

0-11 

0-51 

0-54 

3-25 

X 

0-18 

0-00 

0-12 

0-00 

0-16 

0-00 

0-11 

— 

0-17 

0-15 

0-89 

X9 

0-46 

0-20 

0-62 

0-00 

0-40 

0-38 

0-51 

0-17 

— 

0-58 

3-32 

X10 

0-32 

0-35 

0-51 

0-12 

0-41 

0-53 

0-54 

0-15 

0-58 

— 

3-51 

If  we  wished  to  obtain  experimental  values  of  these,  dice  would  have 
to  be  thrown,  one  to  each  factor.  The  die  corresponding  to  the  group 
factor  called  "Ace"  would  have  its  score  counted  into  every  variate 
containing  the  group  factor  in  question.  The  dice  representing  the 
specific  factors  would,  of  course,  only  be  counted  into  the  one  variate 
in  which  they  occur. 

The  last  column  of  the  preceding  table  gives  the  total  correlation  of 
each  variate  with  all  the  others,  found  by  adding  the  rows  of  the  square 
table.  Rearrange  now  the  sequence  of  the  variates  in  the  order  of 
magnitude  of  these  totals*,  and  we  obtain  the  following  table: 


|    X10 

X3 

X9 

X7 

x& 

^6 

X2 

XL 

*8 

x* 

X10 

0-51 

0-58 

0-54 

0-41 

0-53 

0-35 

0-32 

0-15 

0-12 

*3 

0-51 

— 

0-62 

0-48 

0-39 

0-44 

0-39 

0-32 

0-12 

0-10 

X9 

0-58 

0-62 

— 

0-51 

0-40 

0-38 

0-20 

0-46 

0-17 

0-00 

X, 

0-54 

0-48 

0-51 

— 

0-38 

0-47 

0-35 

0-31 

0-11 

0-10 

*5 

0-41 

0-39 

0-40 

0-38 

— 

0-30 

0-29 

0-44 

0-16 

0-14 

*« 

0-53 

0-44 

0-38 

0-47 

0-30 

— 

0-47 

0-17 

0-00 

0-13 

X2 

0-35 

0-39 

0-20 

0-35 

0-29 

0-47 

— 

0-22 

0-00 

0-18 

Xl 

0-32 

0-32 

0-46 

0-31 

0-44 

0-17 

0-22 

— 

0-18 

0-00 

*J 

0-15 

0-12 

0-17 

0-11 

0-16 

0-00 

0-00 

0-18 

— 

0-00 

*4 

0-12 

0-10 

0-00 

0-10 

0-14 

0-13 

0-18 

0-00 

0-00 

— 

Here  the  tendency  to  hierarchical  order  is  quite  noticeable.  This  par- 
ticular example  is  purposely  chosen  from  among  a  number  calculated, 
as  being  that  which  shows  the  least  hierarchical  tendency.  Even  here 
however  there  is  clearly  a  general  lowering  of  the  coefficients  as  we 
pass  either  along  a  row  or  down  a  column.  The  columnar  correlation 
is  high  for  the  first  few  columns.  For  the  columns  headed  xw  and  xs  it  is 
0-97,  and  as  far  down  as  the  columns  headed  x5  and  x6  it  is  still  0-65.  If 
these  theoretical  numbers  were  blurred  by  experimental  error,  they  might 
well  be  claimed  as  having  come  from  a  perfect  hierarchy  by  the  criteria 
in  vogue.  Other  hierarchies  chosen  at  random  from  those  formed  in 
the  above  manner,  show  still  more  perfect  hierarchical  order,  and  some- 

*  A  convenient  p*an,  but  of  no  theoretical  significance. 
B.  &  T.  12 
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times  the  hierarchical  order  is  almost  quite  perfect,  as  in  the  example 
given  in  the  British  Journal  of  Psychology,  1919,  ix.  p.  343. 

(3)  APPLICATION  OF  THE  "CRITERION"  TO  THESE  CASES, 
APPARENTLY  PROVING  THE  PRESENCE  OF  A  GENERAL  FACTOR 

The  values  of  the  correlation  coefficients  given  in  the  above  table 
are  of  course  the  real  values.  To  obtain  experimental  values  of  these, 
dice  were  thrown,  one  die  to  each  Group  or  Specific  Factor,  and  the 
whole  repeated  20  times,  analogous  to  experiments  on  20  subjects*. 

From  the  dice  scores  the  observed  correlations  between  the  variates 
can  be  calculated,  just  as  the  correlations  between  mental  tests  are 
calculated.  Using  the  product-moment  formula  we  obtain  the  set  of 
values  in  this  table,  arranged  in  hierarchical  order,  only  slightly  different 
from  the  true  hierarchical  order,  except  that  variate  x3  has  changed 
its  position  rather  violently. 

The  Observed  Hierarchy 


i     **         * 

XV) 

•72 

•47 

•64 

•53 

•50 

•34 

•45 

•21 

•09 

x,           -72 

— 

•48 

•43 

•75 

•48 

•32 

•67 

-•26 

•10 

X9 

•47 

•48 

— 

•51 

•46 

•45 

•50 

•46 

-•02 

•24 

•64 

•43 

•51 

— 

•58 

•60 

•20 

•15 

•29 

•08 

X* 

•53 

•75 

•46 

•58 

— 

•63 

•26 

•33 

•05 

-•11 

•50 

•48 

•45 

•60 

•63 

— 

•22 

•29 

-•16 

•18 

xl 

•34 

•32 

•50 

•20 

•26 

•22 

— 

•41 

•38 

•15 

X 

•45 

•67 

•46 

•15 

•33 

•29 

•41 

— 

-•20 

•08 

x\ 

•21 

-•26 

-•02 

•29 

•05 

-•16 

•38 

-•20 

— 

-•11 

x* 

•09 

•10 

•24 

•08 

-•11 

•18 

•15 

•08 

-•11 

— 

The  pairs  of  columns  which  pass  the  Hart  and  Spearman  correctional 
standard  give  the  following  values: 


Columns  passing 
standard 

Observed  columnar 
correlation  R 

True  columnar 
correlation 

The  Hart  and  Spearman 
corrected  columnar 
correlation  R' 

2  and  7 
6    „    7 
2    „    3 
2    „    6 
3    „    6 

0-73 
0-63 
0-70 
0-81 
0-66 

0-75 
0-89 
0-60 
0-88 
0-83 

0-76 
1-15 
1-01 
1-06 
1-04 

Means 

0-71 

0-79 

1-00 

True  mean  columnar  correlation  of  the  whole  table  and  not  merely  / 
of  the  pairs  of  columns  selected  by  the  correctional  standard        ( 

*  G.  H.  Thomson,  Biometrika,  1919,  xn.  pp.  355—366,  where  the  full  details  of  the 
dice  throws  are  given.  Sections  3  and  4  of  the  present  chapter  consist  largely  of  extracts 
from  Bi&metrika,  where  the  diagrams  28  and  29  appeared. 
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Dr  Hart  and  Professor  Spearman  would  therefore  claim  the  hierarchy 
as  being  a  sample  of  a  perfect  one.  The  true  mean  columnar  correlation 
for  the  whole  table  is  0-59,  the  Hart  and  Spearman  correctional  standard 
selects  pairs  of  columns  whose  true  mean  columnar  correlation  is  O79, 
and  the  mean  value  of  these  when  corrected  according  to  their  formula 
rises  to  unity.  This  example  goes  far  towards  shaking  confidence  in 
their  criterion. 


(4)  THE  ERRONEOUS  NATURE  OF  THE  HART  AND 
SPEARMAN  CRITERION 

The  inaccurate  and  exaggerated  estimates  of  hierarchical  order  which 
are  given  by  this  "criterion"  arise  chiefly  from  two  causes,  (1)  the 
erroneous  assumption  that  pr  and  e  are  uncorrelated  (see  p.  170),  and 
(2)  the  action  of  the  "correctional  standard."  We  shall  consider  these 
in  turn. 

Consider  the  formula  for  the  standard  deviation  of  a  correlation 
coefficient,  viz. 


where  N  is  the  number  in  the  sample.  It  follows  from  this  that  the 
larger  correlation  coefficients  will  probably  have  the  smaller  sampling 
errors  e,  disregarding  the  sign  of  e  for  the  moment. 

But  these  signs  of  the  quantities  e  are  not  likely  to  be  indiscriminately 
positive  and  negative.  On  the  contrary,  they  will  have  a  tendency  to 
be  either  all  positive  or  all  negative,  if,  as  is  the  case  in  most  of  the 
columns  of  coefficients  considered  by  Professor  Spearman,  the  corre- 
lations in  the  square  table  are  mainly  positive.  The  errors  in  the  corre- 
lation of  a  variate  x±  with  a  variate  a  are  themselves  correlated  with 
the  errors  in  the  correlation  of  the  variate  a  with  another  variate  ic2, 
according  to  the  formula* 


x\a, 


r         —  v       2  _  «      2  _  v      2\ 

'  x\a        '  x\x%          '  x^a          '  x\a  t 


2  (1  -  rxie?)  (1  -  r^f) 

That  is,  the  correlation  of  the  sampling  errors  of  rxia  with  the  sampling 
errors  of  r^  depends  chiefly  upon  rXlXt.  To  illustrate,  let  us  take  three 

*  Karl  Pearson  and  L.  N.  G.  Filon,  "On  the  Probable  Errors  of  Frequency  Constants," 
Phil  Trans,  of  the  Royal  Soc.  1898,  cxoi.  A,  eqn.  37. 
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correlations  from  an  experiment  in  psychology,  carried  out  by  Mr  Wyatt*. 
If  we  let 

x1  be  the  mental  test  "  Rearranged  Letters," 

x2     „        „        „         "Missing  Digits," 

a      „         „        „        "Analogies," 


the  values  there  found  were 


=  0-63, 
=  0-61. 


Then  by  the  above  formula  the  correlation  of  the  errors  of  these  two 
coefficients  depends  chiefly  upon  rXlX2,  whose  measured  value  is  0-63. 
Using  the  full  formula,  and  employing  the  measured  values  in  default 
of  the  true  ones,  the  correlation  between  rxia  and  rxza  turns  out  to  be  -47. 
It  is  therefore  (to  an  extent  indicated  by  this  value)  probable  that  they 
are  either  both  too  large  or  both  too  small.  The  same  argument  holds, 
in  varying  degrees,  for  the  other  correlations  all  over  Mr  Wyatt' s  table, 
which  are  all  positive.  They  all  have  a  tendency  to  be  either  all  too 
large  or  all  too  small :  in  other  words,  the  e's  tend  to  be  all  of  the  same 
sign.  The  relationship  between  the  correlation  coefficients  of  a  column, 
and  their  errors,  can  therefore  be  summed  up  in  the  following  table, 
in  which  the  symbol  |  e  \  denotes  the  magnitude  of  e  regardless  of  sign. 


large 


small 


small 


large 


S(p'e)  = 


or  p'e 


-    or    + 


The  first  column  shows  the  true  correlations  r'  arranged  in  order  of 
magnitude.  The  second  column  expresses  the  fact  that  the  sampling 
errors  on  any  occasion  will  probably  be  arranged  in  the  reverse  order  of 
magnitude,  disregarding  their  signs.  The  third  column  shows  the  corre- 
lation coefficients  measured  from  their  mean.  The  upper  p"s  are  then 
positive,  and  the  lower  negative,  and  also,  what  is  not  shown  in  the 


*  Stanley  Wyatt,   "The  Quantitative  Investigation  of  Higher  Mental  Processes," 
Brit.  J<mrn.  Psychol  1913,  vi.  p.  131. 
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table,  the  absolute  values  increase  upwards  and  downwards  from  the 
point  where  the  signs  change.  The  fourth  (double)  column  shows  the 
probable  arrangement  of  the  signs  of  the  quantities  e.  If  the  e's  are  all 
tending  to  be  positive,  then  the  left-hand 
member  of  the  double  column  gives  the 
arrangement,  while  if  the  e's  all  tend  to  be 
negative,  the  other  member  of  the  double 
column  does  so.  As  shown  in  the  last  (double) 
column,  therefore,  the  quantities  p'e  tend 
either  to  be  nearly  all  negative  or  nearly  all 
positive.  For  a  very  small  sample  the  signs 
of  /o'e  will  no  doubt  be  quite  irregularly 
arranged.  But  with  such  a  small  sample, 
even  if  p  and  e  were  really  uncorrelated,  it 
would  be  most  unlikely  for  S  (p'e)  to  be 
negligible.  As  the  sample  increases  the  signs 
tend  to  settle  down  to  the  above  arrange- 
ment, and  S  (p'e)  does  not  tend  to  disappear 
compared  with  S  (ee),  but  only  to  take  on 
one  or  other  of  alternative  values.  It  will 
only  be  zero  when  all  the  errors  are  zero, 
i.e.  when  no  corrections  are  needed  to  R'. 
The  distribution  of  S  (p'e)  about  zero  in  a 
number  of  samples  of  the  same  size  will  not, 
that  is,  show  a  maximum  at  zero,  but  a 
minimum,  as  is  shown  qualitatively  in  Fig.  28. 

If,  in  fact,  the  actual  value  of  S  (p'e)  is 
calculated  in  cases  where  the  true  correla- 
tions are  known,  it  is  frequently  found  to  be 
greater  than  the  quantities  S  (ee)  which  are 
left  in  the  expression. 

The  other  approximations  made  in  obtain- 
ing the  criterion  do  not  appear  to  be  so 
erroneous  as  this  one,  though  their  cumulative 
effect  may  explain  some  anomalies.  Leaving 
them  on  one  side  let  us  consider  the  "cor- 
rectional standard  "  required  by  Dr  Hart  and 

Professor  Spearman  before  they  admit  any  pair  of  columns.  It  is  this 
correctional  standard,  combined  with  the  peculiar  distribution  of  R', 
which  chiefly  is  responsible  for  the  exaggeration  of  perfection  produced 
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by  this  criterion,  and  for  the  regularity  with  which  an  average  value  of 
unity  is  arrived  at. 

Let  us  examine  first  the  actual  distribution  of  the  Hart  and  Spearman 
R'  in  a  psychological  hierarchy,  viz.  that  of  Wyatt  already  referred  to, 
and  calculate  R'  not  only  for  those  columns  which  pass  the  correctional 
standard,  but  also  for  other  pairs  of  columns.  What  we  find  is  that  its 
value  rises  as  we  descend  the  hierarchy,  rushing  asymptotically  to 
infinity,  remaining  for  a  time  imaginary,  and  then  returning.  Specimen 
values  from  Mr  Wyatt' s  hierarchy  are  given. 

Pairs  of  columns  Values  of  the  Hart  and  Spearman  Criterion 

Analogies  and  Wordbuilding  0-93  \ 

Completion  and  Wordbuilding       - 0-97 

Completion  and  Part- wholes  ...         ...  1-05  >  Passed  by  the  correctional  standard 

Wordbuilding  and  Part-wholes        0-99 

Part-wholes  and  Memory  (delayed)  ...  0-92  / 

Rearranged  letters  and  Missing  digits        ...  1«17 

Wordbuilding  and  E  R  Test  1-26 

Sentence  construction  and  Fables 1-33 

Rearranged  letters  and  E  R  Test Practically  infinity 

Nonsense  syllables  and  dissected  pictures  ...  Imaginary 

Crossline  test  and  Letter  Squares 0-35,  a  meaningless  value,  both  factors  in  the 

denominator  being  now  negative 

Expressed  in  diagrammatic  form  this  and  similar  calculations  lead 
to  the  conclusion  that  in  actual  practice  the  criterion  is  distributed  as 
in  Fig.  29,  where  the  curve  is  to  be  understood  as  a  "  best  fitting  "  curve 
among  the  values  of  R'  scattered,  with  a  very  considerable  dispersion, 
on  both  sides  of  it.  The  line,  in  fact,  ought  to  be  a  broad  smudge. 

Now  clearly,  with  a  distribution  of  this  sort,  it  is  very  important 
that  the  boundary  between  the  values  that  are  to  be  rejected  and  those 
that  are  to  be  accepted  should  be  chosen  with  the  greatest  care,  and  not 
arbitrarily  but  scientifically.  Either  sound  theoretical  reasons  should 
be  given  for  the  choice  of  the  correctional  standard,  or  the  choice  should 
be  based  empirically  on  experiments  in  material  where  the  truth  is 
known  a  priori,  as  in  the  above  dice  experiments.  For  obviously,  by 
moving  this  boundary,  we  can  make  the  final  average  take  on  almost 
any  value.  Another  point  is  that  the  criterion  rushes  to  infinity  at  such 
speed  that  its  probable  error  must  be  enormous.  Dr  Hart  and  Professor 
Spearman,  however,  give  no  reasons  for  their  choice  of  this  particular 
standard,  upon  which  depends  so  much  the  values  they  obtain.  The 
standard  which  they  thus  arbitrarily  adopt  begins  admitting  the  criteria 
at  just  such  a  distance  above  unity  as  to  balance  the  cases  which  give  a 
criterion  below  unity,  and  entirely  explains  the  remarkable  unanimity  with 
which  this  average  value  unity  is  obtained  by  them  in  their  calculations. 
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In  other  words,  the  remarkable  regularity  with  which  this  criterion 
gives  the  value  unity  is  not  a  property  of  the  investigated  correlation 
coefficients  at  all,  but  is  a  property  possessed  by  the  criterion  itself,  due 
to  errors  and  the  action  of  the  "correctional  standard." 


Descending  the  hierarchy— - 
Fig.  29 

In  the  writers'  opinion  the  work  outlined  in  this  chapter  finally 
proves  the  invalidity  of  Professor  Spearman's  mathematical  argument 
in  favour  of  the  Theory  of  Two  Factors.  If  this  be  so  that  theory  returns 
to  the  status  of  a  possible,  but  unproven,  theory. 

(5)  HIERARCHICAL  ORDER  THE  NATURAL  ORDER  AMONG 

CORRELATION  COEFFICIENTS 

The  fact  is  that  hierarchical  order,  which  Professor  Spearman  was 
the  first  to  notice  among  correlation  coefficients,  is  the  natural  relation- 
ship among  these  coefficients,  on  any  theory  whatever  of  the  cause  of 
the  correlations,  excepting  only  theories  specially  designed  to  prevent 
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its  occurrence.  It  is  the  absence  of  hierarchical  order  which  would  be 
a  remarkable  phenomenon  requiring  special  explanation;  its  presence 
requires  none  beyond  what  is  termed  chance. 

An  analogy  from  the  simple  repeated  measurements  of  a  linear 
magnitude  may  help  to  illustrate  this.  Indeed  it  is  rather  more  than  an 
analogy,  being  in  fact  the  same  phenomenon  in  its  simplest  terms  and 
dimensions.  It  is  well  known  that  many  measurements  of  the  same 
quantity,  made  with  all  scientific  precautions,  under  apparently  the 
same  conditions,  and  with  an  avoidance  of  all  known  sources  of  error, 
nevertheless  do  not  give  a  number  of  identical  values.  The  values  are 
all  different,  but  are  not  without  law  and  order  in  their  arrangement. 
They  are  grouped  about  a  centre  from  which  the  density  decreases  in 
both  directions,  and  it  is  found  that  this  grouping  is  for  most  practical 
purposes  closely  represented  by  the  Normal  or  Gaussian  Curve  of  Error 
(or  one  of  the  more  general  Pearsonian  Curves). 

Experimenters  are  not  surprised*  to  find  their  data  obeying  the 
Probability  Law,  nor  do  they  require  a  special  theory  to  explain  it. 
On  the  contrary,  it  is  the  departures  from  this  Law  which  if  wide  would 
require  special  investigation,  and  if  confirmed  would  require  a  special 
theory.  In  the  same  way  hierarchical  order  among  correlation  coeffi- 
cients should  not  cause  surprise,  though  any  marked  variation  from  this 
order  would  demand  investigation. 

Correlation  coefficients  are  themselves  correlated,  and  n  correlation 
coefficients  form  an  w-fold  or  n-dimensional  correlation-surface.  The 
particular  and  convenient  form  of  tabulation  of  correlation  coefficients 
adopted  by  Professor  Spearman  and  followed  by  most  other  psycho- 
logical workers  brings  to  light,  in  the  form  of  "  hierarchical  order,  "  one 
of  the  properties  of  this  correlation-surface  of  the  correlations. 

In  an  article  entitled  "On  the  Probable  Errors  of  Frequency  Con- 
stants and  on  the  Influence  of  Random  Selection  on  Variation  and 
Correlation,"  in  the  Phil.  Trans.  1898,  cxci.  A,  pp.  229—311,  Professor 
Pearson  and  Mr  Filon  give  the  following  formulae  : 

+  ^r^r^  -  r122  -  r^  -  r312) 


2(l-ru«)(l-ru*) 

(»24  -  *2S*84)  +  (*14  ~  '13*84)  (*23  ~  Vis)     1 


p      ___        (yis  ~  '14*48)  (y24  ~  '21*14)  +  (ru  ~  'M  '24)  (fa  -  ra4'4»)l 
r«r*4  9  n      v  2\  n      ~  2\  ' 

ro*t%  ^  (L  ~  riz  M1  ~  ^34  ) 

so  that,  as  they  say,  "errors  in  the  correlations  of  a  first  organ  with  a 
second  and  a  third  have  a  correlation  themselves  of  the  first  order," 
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and  "errors  in  the  correlation  of  two  organs  and  in  the  correlation  of 
a  second  two  have  only  correlation  of  the  second  order." 

Suppose  now  that  the  correlations  among  a  number  of  variates  taken 
in  pairs  are  really  all  the  same,  and  positive,  and  in  a  sample  let  the 
observed  value  of  r38  be  the  highest  observed  value : 
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HI=  highest  correlation.     H2= second  highest,  and  so  on.     h = tendency  to  be  high. 

Then,  because  of  the  above  theorems  of  Pearson  and  Filon,  the  rows 
and  columns  x3  and  xs  will  probably  contain  more  total  correlation  than 
do  the  others,  and  the  second  highest  correlation  will  probably  be  in 
one  of  these.  Let  it  be  r36.  Then,  after  the  rows  x3  and  x8  the  row  #6 
will  probably  contain  many  high  correlations  and  r86  will  probably  be 
the  third  highest  coefficient,  because  it  is  a  node  where  two  ridges  of 
high  correlation  cross.  If  it  is,  then  the  hierarchy  so  far  is  excellent,  as 
can  be  seen  on  rearranging  the  square  table  so  as  to  bring  x3,  x8,  and  x6 
to  the  head*. 

Where  now  is  the  fourth  highest  r  likely  to  be  found?  Somewhere 
among  those  marked  h  for  high,  and  slightly  more  probably  among  the 
r's  of  the  row  xz.  If  to  avoid  further  rearranging  we  take  it  to  be  r31, 
then  rgl  is  most  probably  the  fifth,  and  r61  the  sixth,  because  they  are 
nodes.  And  so  on.  In  fact  it  is  clear  that  when  the  r's  are  really  equal 

*  See  next  page 
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to  one  another,  then  sampling  the  population  will  give  a  set  of  observed 
r's  which  are  arranged  not  in  haphazard,  but  in  hierarchical  order. 


Hz  h4  h  h  h  h 
h  h  h  h 
h  h  h  h 


h  h  h 
h  h  h 
h  h  h 


In  the  experiment  described  above  with  cards  and  dice,  where 
hierarchical  tendency  was  found  to  be  produced  by  group  and  specific 
factors  without  any  general  factor,  there  was,  however,  no  question  of 
sampling  the  population.  The  hierarchical  order  already  appears  in  the 
theoretical  coefficients.  There  is  here,  however,  another  kind  of  sampling 
present,  viz.  sampling  of  the  elements  which  make  up  the  variates. 

Let  us  suppose,  instead  of  deciding  the  numbers  of  groups  and  specific 
elements  as  we  did  by  drawing  cards,  we  had  dipped  into  an  infinite  bag 
containing  black  balls  and  white  balls,  the  former  representing  group 
factors  and  the  latter  specific  factors.  Then  the  most  probable  event 
would  have  been  that  the  proportion  of  group  factors  and  specific 
factors  in  each  variate  would  have  been  the  same  as  the  proportion  in 
which  the  balls  occurred  in  the  bag.  If,  in  addition,  we  assume  the  samples 
drawn  to  be  all  the  same  size,  then  the  most  probable  result  of  the  whole 
experiment  would  have  been  obtaining  all  the  correlations  equal. 

That  they  do  not  come  out  equal  may  be  regarded  as  due  to  the 
sampling.  The  samples  vary  in  size,  the  proportion  of  group  factors 
varies  from  sample  to  sample,  and  the  distribution  of  the  individual 
group  factors  among  the  several  variates  departs  from  the  most  probable 
distribution.  From  this  point  of  view  the  departure  of  the  correlation 
coefficients  from  equality  is  due  to  errors  of  sampling,  and  the  applica- 
tion of  the  theorem  of  Pearson  and  Filon  would  lead  us  to  expect  what 
as  we  have  seen  does  actually  occur,  namely,  hierarchical  order. 
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The  experiment  with  the  bag  of  balls  would  as  a  matter  of  fact  not 
produce  as  great  a  departure  from  equality  of  correlation  coefficients 
as  is  found  in  practice  in  experimental  psychology,  or  as  was  found  in 
our  form  of  the  experiment  with  card  drawing.  This  in  the  latter  case 
is  because  the  cards  give  a  greater  variation  of  the  proportion  of  group 
to  specific  factors  than  would  be  found  with  the  bag  of  balls.  And  this 
is  also  the  case  in  mental  tests,  which  are  not  chance  samples  of  the 
mental  elements,  but  are  carefully  chosen  so  as  to  measure  different 
kinds  of  activity.  '. 

This  application  of  Pearson  and  Filon's  theorem  (which  contemplates 
only  sampling  errors  in  the  ordinary  sense)  to  the  changes  in  correlation 
produced  by  sampling  the  underlying  elements,  is  no  doubt  somewhat 
novel,  and  may  appear  to  be  a  difficulty. 

Further  consideration  leads  to  the  following  resolution  of  the 
difficulty. 

Suppose  that  n  variates  (in  our  work  the  scores  in  mental  tests) 
are  so  connected  by  factors  that  the  correlations  are  all  equal  and 
positive.  Then  let  a  small  sample  of  the  population  be  taken.  The 
observed  correlations  will  show  departures  from  equality,  and  will  be 
found  to  be  in  hierarchical  order.  This  hierarchical  order  is  due  to 
sampling  the  population. 

Now  consider  why  the  correlations  do  not  come  out  at  their  true 
values.  They  give  of  course  the  true  values  for  the  sample.  The  reason 
of  their  departing  from  the  true  values  of  the  whole  population  is  that 
(a)  some  of  the  factors  which  really  are  links  between  the  variates  (the 
mental  activities)  happen  to  have  remained  steadier  than  usual  during 
the  sample.  In  the  limit  a  factor  might  happen  to  retain  exactly  the 
same  value  through  the  various  individuals  of  the  sample.  That  is, 
some  of  the  linking  factors  do  not  in  reality  come  into  action,  or  not  in 
their  full  force;  (6)  on  the  other  hand,  some  factors  which  are  really 
different  and  unconnected  may  happen  by  chance  to  rise  and  fall 
together,  through  the  sample,  and  more  or  less  to  act  as  one.  That  is, 
fictitious  linking  factors  are  created,  which  would  disappear  with  a 
larger  sample. 

Clearly  therefore  a  hierarchy  of  correlation  coefficients,  caused  by 
sampling  the  population,  is  due  to  chance  having  caused  a  change  in 
the  apparent  factors  acting.  It  follows  that  if  we  make  a  real  change 
in  the  factors  acting,  we  shall  get  a  hierarchy,  and  this  is  what  we  do 
when  we  choose  the  mental  tests  to  be  employed  in  any  research.  Each 
mental  test  is  a  test  of  a  sample  of  abilities. 
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The  laws  governing  the  correlation  of  correlation  coefficients  which 
vary  because  of  sampling  the  population  can,  in  fact,  be  applied  without 
hesitation  to  the  relationships  between  "true"  correlations  in  the  whole 
of  any  population  simply  because  any  such  population  is  itself  a  sample. 
English  grammar  school  boys  of  12  are  themselves  a  sample  of  a  larger 
boyhood;  the  whole  human  race  indeed  is  a  sample  of  "what  might  have 
been,"  selected  by  the  struggle  for  survival. 

The  whole  question  clearly  has  philosophical  bearings  on  the  degree 
of  reality  of  causal  connections;  for  on  this  view  those  chance  links  in 
a  small  sample  which  were  a  few  paragraphs  ago  termed  "fictitious  links, 
which  would  disappear  with  a  larger  sample,"  do  not  differ  except  in 
degree  from  the  "real"  causal  links  which  we  only  term  real  because 
they  persist  throughout  the  largest  sample  with  which  we  are  acquainted. 

In  another  direction  there  are  connections  with  the  difference,  which 
is  one  of  degree  only,  between  what  is  called  "partial"  correlation  and 
"entire"  correlation*. 

The  conclusion  to  be  drawn  is  that  hierarchical  order  is  the  natural 
order  to  expect  among  correlation  coefficients,  on  a  theory  of  chance 
sampling  alone,  and  that  therefore,  by  the  principle  of  Occam's  razor,  its 
presence  cannot  be  made  the  criterion  of  the  existence  of  any  special  form 
of  causal  connection,  such  as  is  assumed  in  the  Theory  of  Two  Factors. 

(6)  THOMSON'S  SAMPLING  THEORY  OF  ABILITY  f 
In  place  therefore  of  the  two  factors  of  that  theory,  one  General 
and  the  other  Specific,  Thomson  prefers  to  think  of  a  number  of  factors 
at  play  in  the  carrying  out  of  any  activity  such  as  a  mental  test,  these 
factors  being  a  sample  of  all  those  which  the  individual  has  at  his  com- 
mand. 

The  first  reason  for  preferring  this  theory  is  that  of  Occam's  razor. 
It  makes  fewer  assumptions  than  does  the  more  special  form  of  theory. 
It  does  not  deny  General  Ability,  for  if  the  samples  are  large  there  will 
of  course  be  factors  common  to  all  activities.  On  the  other  hand  it  does 
not  assert  General  Ability,  for  the  samples  may  not  be  so  large  as  this, 
and  no  single  factor  may  occur  in  every  activity.  If  moreover  a  number 
of  factors  do  run  through  the  whole  gamut  of  activities,  forming  a 
General  Factor,  this  group  need  not  be  the  same  in  every  individual. 

*  See  Karl  Pearson,  "On  the  Influence  of  Natural  Selection  on  the  Variability  and 
Correlation  of  Organs,"  Phil  Trans.  Roy.  Soc.  London,  1902,  cc.  A,  pp.  1 — 66;  Godfrey 
H.  Thomson,  "The  Proof  or  Disproof  of  the  Existence  of  General  Ability,"  Brit.  Journ. 
Psychol  1919,  ix.  pp.  321—336. 

t  Psychol.  Review,  1920,  xxvn.  p.  183. 
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In  other  words  General  Ability,  if  possessed  by  any  individual,  need 
not  be  psychologically  of  the  same  nature  as  any  General  Ability 
possessed  by  another  individual.  Everyone  has  probably  known  men 
who  were  good  all  round,  but  Jones  may  be  a  good  all  round  man  for 
different  reasons  from  those  which  make  Smith  good  all  round. 

The  Sampling  Theory,  then,  neither  denies  nor  asserts  General 
Ability,  though  it  says  it  is  unproven.  Nor  does  it  deny  Specific  Factors. 
On  the  other  hand  it  does  deny  the  absence  of  Group  Factors.  It  is  this 
absence  of  Group  Factors  which  is  in  truth  the  crux  of  Professor 
Spearman's  theory,  which  is  not  so  much  a  theory  of  general  ability ,. 
or  a  theory  of  two  factors,  as  a  Theory  of  the  Absence  of  Group  Factors. 
And  inasmuch  as  its  own  disciples  have  begun  to  require  Group  Factors 
to  explain  their  data,  its  distinguishing  mark  would  appear  in  any  case 
to  be  disappearing. 

Such  Group  Factors  as  are  admitted  by  Professor  Spearman  are  of 
very  narrow  range,  and  are  mutually  exclusive,  that  is  they  do  not 
overlap.  Both  these  points  follow  from  the  sentence  used  in  the  1912 
article  with  Dr  Hart,  where  it  is  said  that,  in  the  case  of  performances 
too  alike,  "when  this  likeness  is  diminished,  or  when  the  resembling 
performances  are  pooled  together,  a  point  is  soon  reached  where  the 
correlations  are  still  of  considerable  magnitude,  but  now  indicate  no 
common  factor  except  the  General  one." 

Since  this  point  is  soon  reached,  the  Group  Factors  must  be  narrow 
in  range.  Since  pooling  a  few  performances  will  obliterate  any  Group 
Factors,  they  must  be  exclusive  of  one  another.  For  if  A,  B,  C  and  D 
are  four  tests,  in  which  A  and  B  have  a  Group  Factor  common  to  them, 
and  C  and  D  another,  then  of  course  by  pooling  A  with  B  and  also  C 
with  D  we  can  obtain  two  pools  AB  and  CD  which  have  no  link.  But 
if  A,  B  and  C  have  one  Group  Factor,  and  C  and  D  have  another,  then 
these  Group  Factors  cannot  be  separated  into  Specific  Factors.  In  fact, 
a  Specific  Factor  is  a  separated  Group  Factor,  and  Professor  Spearman's 
theory  asserts  that  Group  Factors,  if  any,  are  separable  and  mutually 
exclusive.  This  is  a  great  stumbling-block  in  the  way  of  the  acceptance 
of  the  Theory  of  Two  Factors,  unless  perhaps  "Specific  Factor"  is 
interpreted  in  the  way  suggested  later. 

It  is  a  fact  which  will  be  admitted  by  most  that  the  same  activity 
is  not  performed  in  the  same  way  by  different  individuals,  even  though 
they  are  equally  expert.  Not  only  are  Specific  Factors  therefore  required 
by  this  theory  for  every  separate  activity,  excluding  only  any  which 
are  very  closely  similar,  but  also  Specific  Factors  of  different  psycho- 
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logical  natures  are  required  for  each  individual.  Further,  the  same 
individual  does  not  always  perform  the  same  activity  in  the  same  way. 
A  man  using  an  ergograph  will,  as  he  tires,  begin  to  employ  muscles 
other  than  those  naturally  used  at  the  outset.  When  we  are  returning 
from  a  cycle  ride  muscles  are  used  in  a  different  manner  from  the  style 
adopted  at  the  start,  indeed  sometimes  deliberate  changes  are  made  to 
give  relief.  And  in  the  same  way  a  mental  task  is  performed  by  different 
methods  at  different  times.  Does  this  then  mean  a  different  Specific 
Factor  for  each  way  of  doing  a  task?  All  these  difficulties  appear  to 
argue  against  the  Theory  of  Two  Factors,  and  seem  to  be  considerably 
cleared  up  by  the  Sampling  Theory. 

Finally,  the  Sampling  Theory  appears  to  be  in  accordance  with  a 
line  of  thought  which  has  already  proved  fruitful  in  other  sciences. 
Any  individual  is,  on  the  Mendelian  theory,  a  sample  of  unit  qualities 
derived  from  his  parents,  and  of  these  a  further  sample  is  apparent 
and  explicit  in  the  individual,  the  balance  being  dormant  but  capable 
of  contributing  to  the  sample  which  is  to  form  his  child.  It  seems  a 
natural  step  further  to  look  upon  any  activity  carried  out  by  this  indi- 
vidual as  involving  a  further  sample  of  these  qualities. 

(7)  THE  DIFFICULTY  OF  "TRANSFER  OF  TRAINING" 

Although  Professor  Spearman's  Theory  of  Two  Factors  has  been 
chiefly  based  by  him  on  the  line  of  argument  which,  it  is  suggested, 
has  now  been  proved  invalid,  viz.  the  "hierarchy"  argument,  yet  there 
is  another  and  powerful  form  of  reasoning  which  can  be  brought  to 
its  support,  based  upon  the  fact  that,  according  to  some  experimenters, 
improvement  in  any  activity  due  to  training  does  not  transfer  in  any 
appreciable  amount  to  any  other  activity,  except  to  those  very  similar 
indeed  to  the  trained  activity.  And  even  those  workers  who  do  not 
agree  that  this  is  an  experimental  fact  are  usually  content  to  take  a 
defensive  attitude  and  say  that  transfer  is  not  disproved.  Few  if  any 
will  say  that  it  is  proved. 

This  certainly  seems  to  point  to  the  absence  of  Group  Factors,  and 
to  support  Professor  Spearman's  theory,  which  only  needs  to  add  to 
itself  the  assumption  that  the  Specific  Factors  are,  while  the  General 
Factor  is  not,  capable  of  being  improved  by  training,  to  fit  the  case 
admirably.  Of  course,  if  transfer  really  occurs,  the  argument  proves 
the  opposite.  And  although  psychological  experiment  points  on  the 
whole  to  the  absence  or  the  narrowness  of  transfer,  yet  popular  opinion 
among  business  men,  schoolmasters,  and  others,  is  in  favour  of  transfer 
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to  a  considerable  extent.  Assuming  no  transfer,  however,  how  can  the 
Sampling  Theory,  with  its  numerous  Group  Factors,  explain  this? 

It  is  necessary  to  assume  that  the  Group  Factors  are  all  unim- 
provable or  only  slightly  improvable  by  training,  though  they  may 
change  with  the  growth  and  development  of  the  individual.  The 
improvement  which  certainly  takes  place  when  we  practise  any  activity 
is  due,  it  may  then  be  assumed,  not  to  improvement  in  the  elemental 
abilities  which  form  the  sample,  but  to  a  weeding  out,  and  selection  of 
these.  The  sample  alters,  mainly  no  doubt  is  diminished,  though  addi- 
tions are  also  conceivable.  It  becomes  a  more  economical  sample,  and 
waste  of  effort  in  using  elements  which  are  unnecessary  is  avoided. 
Improvement  in  any  mental  activity  may  on  this  view  be  compared 
with  improvement  in  a  manual  dexterity,  in  which  it  is  notorious  that 
the  improvement  consists  largely  in  the  avoidance  of-  unnecessary 
movements. 

When  another  activity  is  then  attempted,  the  elemental  factors  are 
just  the  same  as  they  would  have  been  had  the  practice  in  the  first 
activity  not  taken  place.  The  new  activity  will  be  performed  by  a  new 
group  of  factors,  which  sample  will  as  in  the  first  case  be  in  the  beginning 
wasteful  and  will  include  many  unnecessary  elements.  Transfer  of 
improvement  gained  in  the  first  activity  will  therefore  not  take  place 
except  in  so  far  as  the  second  activity  is  recognised  as  a  mere  variant  of 
the  original  one,  in  which  case  the  weeding  out  process  which  has  taken 
place  in  the  first  case  may  be  done  at  the  very  first  attempt,  at  any  rate 
to  some  extent. 

To  use  another  analogy,  the  improvement  which  takes  place  when 
a  football  team  practises  playing  together  for  a  series  of  matches  is  due 
more  to  team  work  than  to  individual  improvement.  A  new  team,  even 
though' it  contain  a  large  proportion  of  players  from  the  first  team,  will  not 
have  this  unity  of  action.  There  will  be  little  transfer  of  improvement. 

According  to  the  view  here  developed,  it  is  the  weeding  out  of  the 
sample  of  elemental  abilities  which  is  specific.  The  team  work  is  specific, 
though  the  players  play  for  several  clubs.  This  would  appear  to  enable 
a  reconciliation  to  be  effected  between  the  almost  universal  belief  in 
"  types  "  of  ability  (to  which  Professor  Spearman  refers)  and  the  experi- 
mental facts  concerning  both  correlation  and  transfer.  If  there  be  a 
General  Factor  at  all,  it  might  be  the  power  to  shake  down  rapidly  into 
good  team  work,  in  a  word,  educability.  But  there  seems  no  objection 
to  assuming  that  this,  instead  of  being  a  General  Factor,  is  a  property 
of  each  elemental  factor,  varying  from  factor  to  factor. 
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To  sum  up  this  section:  if  transfer  of  training  really  does  not  occur 
to  any  great  extent,  then  it  has  to  be  admitted  that  the  Theory  of  Two 
Factors  readily  explains  this.  But  the  Sampling  Theory  can  also  do  so, 
in  a  manner  which  is  perhaps  not  so  easy  to  set  forth,  but  which  never- 
theless appears  to  be  more  illuminating  and  less  artificial  than  the 
alternative  theory. 

(8)   CONCLUSIONS 

Professor  Spearman's  Theory  of  Two  Factors,  which  assumes  that 
ability  in  any  performance  is  due  to  (a)  a  General  Factor  and  (6)  a  Specific 
Factor  (Group  Factors  being  absent,  or  at  any  rate  very  narrow  in 
range  and  mutually  exclusive),  is  based  chiefly  on  the  observed  fact  that 
correlation  coefficients  in  psychological  tests  tend  to  fall  into  "hierar- 
chical order."  It  has  been  shown,  however,  that  the  criterion  adopted 
for  evaluating  the  degree  of  perfection  of  hierarchical  order  present  is 
untrustworthy  and  has  led  to  over-estimation.  Such  hierarchical  order 
as  is  actually  present  is  in  fact  the  natural  thing  to  expect,  and  it  is  the 
absence  of  such  which  should  occasion  surprise.  The  proof  of  the  Theory 
of  Two  Factors  which  is  based  on  the  presence  of  hierarchical  order 
therefore  falls  to  the  ground.  The  theory  remains  a  possible  explanation 
of  the  facts  but  ceases  to  be  the  unique  explanation.  As  an  alternative 
theory  Thomson  has  advanced  a  Sampling  Theory  of  Ability,  in  which 
any  performance  is  considered  as  being  carried  out  by  a  sample  of 
Group  Factors.  This  theory  is  preferred  because  it  makes  fewer  and  less 
special  assumptions,  because  it  is  more  elastic  and  wider,  and  because 
it  is  in  closer  accord  with  theories  in  use  in  biology  and  in  the  study  of 
heredity  *. 

*  For  articles  on  the  General  Ability  controversy  which  have  appeared  while  this 
book  was  going  through  the  press  the  reader  is  referred  to  the  conclusion  of  the  Biblio- 
graphy, p.  212. 


APPENDIX  T 

TABLES 

1 .  Fechner's  Fundamental  Table. 

2.  Urban's  Tables  for  the  Constant  Process. 

3.  Table  of  Muller-Urban  Weights. 

4.  Reciprocals  of  pq,  where  p  +  q  =  1. 

5.  Rich's  Checking  Table  for  the  Constant  Process. 

1.    Fechner's  Fundamental  Table. 


p 

7 

P 

7 

P 

7 

0-50 

0-0000 

0-67 

0-3111 

0-84 

0-7031 

0-51 

0-0177 

0-68 

0-3307 

0-85 

0-7329 

0-52 

0-0355 

0-69 

0-3506 

0-86 

0-7639 

0-53 

0-0532 

0-70 

0-3708 

0-87 

0-7965 

0-54 

0-0710 

0-71 

0-3913 

0-88 

0-8308 

0-55 

0-0888 

0-72 

0-4121 

0-89 

0-8673 

0-56 

0-1067 

0-73 

0-4333 

0-90 

0-9062 

0-57 

0-1247 

0-74 

0-4549 

0-91 

0-9480 

0-58 

0-1427 

0-75 

0-4769 

0-92    0-9935 

0-59 

0-1609 

0-76 

0-4994 

0-93    1-0435 

0-60 

0-1792 

0-77 

0-5224 

0-94 

1-0993 

0-61 

0-1975 

0-78 

0-5460 

0-95 

1-1630 

0-62 

0-2160 

0-79 

0-5702 

0-96 

1-2380 

0-63 

0-2346 

0-80 

0-5951 

0-97 

1-3300 

0-64 

0-2535 

0-81 

0-6208 

0-98 

1-4520 

0-65 

0-2725 

0-82 

0-6473 

0-99 

1-6450 

0-66 

0-2916 

0-83 

0-6747 

1-00 

00 

If  p  is  <  0-5,  look  in  the  Table  not  for  p  but  for  1  —  p,  and  take 
y  negative.  Thus  the  y  for  p  =  0-25  is  -  04769. 

This  Table  is  less  generally  useful  than  Sheppard's  Tables  of  the 
Probability  Integral,  Tables  I  and  II  of  Pearson's  Tables  for  Statisticians 
and  Biometricians  (Cambridge  University  Press).  Table  I  there  differs 
from  this  only  by  a  factor  -y/2 :  but  gives  many  more  values  and  to  more 
decimal  places. 
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2.    Urban1  s  Tables  for  the  Constant  Process. 
From  Arckivf.  d.  ges.  Psychol.  1912,  xxiv.  240 — 241*. 


p 

W 

yW 

2  W 

2*W 

2yW 

3W 

&W 

3yW 

4  JF 

42  W 

47  W 

0-50 

1-0000 

0-0000 

2-0000 

4-0000 

0-0000 

3-0000 

9-0000 

0-0000 

4-0000 

16-0000 

0-0000 

0-51 

0-9998 

0-0177 

1-9996 

3-9991 

0-0354 

2-9993 

8-9980 

0-0531 

3-9991 

15-9965 

0-0708 

0-52 

0-9991 

0-0355 

1-9982 

3-9963 

0-0709 

2-9972 

8-9917 

0-1064 

3-9963 

15-9853 

0-1419 

0-53 

0-9980 

0-0531 

1-9959 

3-9918 

0-1062 

2-9938 

8-9816 

0-1593 

3-9918 

15-9672 

0-2124 

0-54 

0-9964 

0-0707 

1-9928 

3-9855 

0-1415 

2-9891 

8-9674 

0-2122 

3-9855 

15-9421 

0-2830 

0-55 

0-9943 

0-0883 

1-9886 

3-9772 

0-1766 

2-9829 

8-9487 

0-2649 

3-9772 

15-9088 

0-3532 

0-56 

0-9918 

0-1058 

1-9836 

3-9671 

0-2116 

2-9753 

8-9260 

0-3175 

3-9671 

15-8685 

0-4233 

0-57 

0-9888 

0-1233 

1-9776 

3-9551 

0-2466 

2-9663 

8-8990 

0-3699 

3-9551 

15-8205 

0-4932 

0-58 

0-9853 

0-1406 

1-9706 

3-9413 

0-2812 

2-9560 

8-8679 

0-4218 

3-9413 

15-7651 

0-5624 

0-59  1 

0-9814 

0-1579 

1-9627 

3-9254 

0-3158 

2-9441 

8-8322 

0-4737 

3-9254 

15-7018 

0-6316 

0-60 

0-9768 

0-1750 

1-9537 

3-9074 

0-3501 

2-9306 

8-7916 

0-5252 

3-9074 

15-6296 

0-7002 

0-61 

0-9720 

0-1920 

1-9440 

3-8881 

0-3839 

2-9161 

8-7482 

0-5759 

3-8881 

15-5523 

0-7679 

0-62 

0-9666 

0-2088 

1-9332 

3-8663 

0-4176 

2-8997 

8-6992 

0-6263 

3-8663 

15-4653 

0-8351 

0-63 

0-9607 

0-2254 

1-9214 

3-8429 

0-4508 

2-8822 

8-6465 

0-6762 

3-8429 

15-3715 

0-9015 

0-64 

0-9542 

0-2419 

1-9084 

3-8168 

0-4838 

2-8626 

8-5878 

0-7257 

3-8168 

15-2672 

0-9676 

0-65 

0-9473 

0-2581 

1-8945 

3-7890 

0-5163 

2-8418 

8-5253 

0-7744 

3-7890 

15-1562 

1-0325 

0-66 

0-9398 

0-2741 

1-8797 

3-7594 

0-5481 

2-8196 

8-4586 

0-8222 

3-7594 

15-0376 

1-0962 

0-67 

0-9317 

0-2899 

1-8634 

3-7268 

0-5797 

2-7951 

8-3853 

0-8696 

3-7268 

14-9072 

1-1594 

0-68 

0-9232 

0-3053 

1-8464 

3-6929 

0-6106 

2-7697 

8-3090 

0-9159 

3-6929 

14-7715 

1-2212 

0-69 

0-9140 

0-3205 

1-8280 

3-6561 

0-6409 

2-7421 

8-2262 

0-9614 

3-6561 

14-6243 

1-2818 

0-70 

0-9043 

0-3353 

1-8085 

3-6170 

0-6706 

2-7128 

8-1383 

1-0059 

3-6170 

14-4682 

1-3412 

0-71 

0-8939 

0-3498 

1-7878 

3-5755 

0-6996 

2-6816 

8-0449 

1-0493 

3-5755 

14-3021 

1-3991 

0-72 

0-8830 

0-3639 

17659 

3-5318 

0-7277 

2-6489 

7-9466 

1-0916 

3-5318 

•14-1274 

1-4555 

0-73 

0-8713 

0-3775 

1-7426 

3-4852 

0-7541 

2-6139 

7-8417 

1-1326 

3-4852 

13-9408 

1-5101 

0-74 

0-8590 

0-3908 

1-7180 

3-4360 

0-7815 

2-5770 

7-7310 

1-1723 

3-4360 

13-7440 

1-5630 

0-75 

0-8460 

0-4035 

1-6921 

3-3842 

0-8070 

2-5381 

7-6144 

1-2104 

3-3842 

13-5366 

1-6139 

0-76 

0-8323 

0-4157 

1-6646 

3-3293 

0-8313 

2-4970 

7-4909 

1-2470 

3-3293 

13-3171 

1-6626 

0-77 

0-8179 

0-4273 

1-6357 

3-2714 

0-8545 

2-4536 

7-3607 

1-2818 

3-2714 

13-0858 

1-7090 

0-78 

0-8025 

0-4382 

1-6051 

3-2102 

0-8764 

2-4076 

7-2229 

1-3146 

3-2102 

12-8406 

1-7527 

0-79 

0-7865 

0-4484 

1-5729 

3-1459 

0-8969 

2-3594 

7-0782 

1-3453 

3-1459 

12-5835 

1-7938 

0-80 

0-7695 

0-4579 

1-5390 

3-0780 

0-9159 

2-3085 

6-9255 

1-3738 

3-0780 

12-3120 

1-8317 

0-81 

0-7515 

0-4665 

1-5031 

3-0061 

0-9331 

2-2546 

6-7638 

1-3996 

3-0061 

12-0245 

1-8662 

0-82 

0-7327 

0-4743 

1-4653 

2-9307 

0-9485 

2-1980 

6-5940 

1-4228 

2-9307 

11-7227 

1-8970 

0-83 

0-7129 

0-4810 

1-4257 

2-8515 

0-9619 

2-1386 

6-4158 

1-4429 

2-8515 

11-4059 

1-9239 

0-84 

0-6921 

0-4866 

1-3842 

2-7683 

0-9732 

2-0762 

6-2287 

1-4598 

2-7683 

11-0733 

1-9464 

0-85 

0-6697 

0-4908 

1-3394 

2-6788 

0-9816 

2-0091 

6-0273 

1-4725 

2-6788 

10-7152 

1-9633 

0-86 

0-6463 

0-4937 

1-2927 

2-5853 

0-9875 

•9390 

5-8170 

1-4812 

2-5853 

10-3413 

1-9749 

0-87 

0-6215 

0-4950 

1-2430 

2-4860 

0-9900 

•8645 

5-5935 

1-4851 

2-4860 

9-9440 

1  -9801 

0-88 

0-5953 

0-4946 

1-1907 

2-3813 

0-9892 

•7860 

5-3580 

1-4838 

2-3813 

9-5253 

1-9784 

0-89 

0-5673 

0-4920 

1-1346 

2-2692 

0-9840 

•7019 

5-1056 

1-4760 

2-2692 

9-0766 

1-9680 

0-90 

0-5376 

0-4871 

1-0751 

2-1502 

0-9743 

•6126 

4-8380 

1-4614 

2-1502 

8-6008 

1-9485 

0-91 

0-5059 

0-4796 

1-0118 

2-0236 

0-9592 

•5177 

4-5531 

1-4388 

2-0236 

8-0944 

1-9184 

0-92 

0-4718 

0-4687 

0-9435 

1-8871 

0-9374 

•4153 

4-2459 

1-4061 

1-8871 

7-5483 

1-8748 

0-93 

0-4351 

0-4540 

0-8702 

1-7403 

0-9080 

1-3052 

3-9157 

1-3620 

1-7403 

6-9613 

1-8160 

0-94 

0-3954 

0-4346 

0-7907 

1-5814 

0-8692 

1-1861 

3-5582 

1-3039 

1-5814 

6-3258 

1-7385 

0-95 

0-3519 

0-4093 

0-7038 

1-4076 

0-8185 

1-0557 

3-1671 

1-2278 

1-4076 

5-6304 

1-6370 

0-96 

0-3036 

0-3759 

0-6073 

1-2146 

0-7518 

0-9109 

2-7328 

1-1277 

1-2146 

4-8582 

1-5036 

0-97 

0-2469 

0-3282 

0-4936 

0-9871 

0-6564 

i  0-7403 

2-2210 

09847 

0-9871 

3-9485 

1-3129 

0-98 

0-1881 

0-2732 

0-3762 

0-7525 

0-5463 

i  0-5644 

1-6931 

0-8195 

0-7525 

3-0099 

1-0926 

0-99 

0-1127 

0-1854 

0-2254 

0-4508 

0-3708 

0-3381 

1-0142 

0-5561 

0-4508 

1-8030 

0-7416 

*  With  three  corrections,  two  of  which  were  made  by  Urban  in  the  Praxis  der  Ron- 
stanzmethode,  and  the  third  by  Rich  in  Amer.  Journ.  Psychol  1918,  xxix.  121.  The  last 
was  also  rediscovered  by  the  Cambridge  Univ.  Press  proof  reader. 
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2.    Urban' s  Tables  for  the  Constant  Process  (contd.). 

From  Archivf.  d.  ges.  Psychol.  1912,  xxiv.  240 — 241. 


p 

5W 

5*W 

5yW 

QW 

62  W 

QyW 

TW 

72  W 

77  W 

0-50 

5-0000 

25-0000 

0-0000 

6-0000 

36-0000 

0-0000 

7-0000 

49-0000 

0-0000 

0-51 

4-9989 

24-9945 

0-0885 

5-9987 

35-9921 

0-1062 

6-9985 

48-9892 

0-1239 

0-52 

4-9954 

24-9770 

0-1773 

5-9945 

35-9669 

0-2128 

6-9936 

48-9549 

0-2483 

0-53 

4-9898 

24-9488 

0-2655 

5-9877 

35-9262 

0-3185 

6-9856 

48-8996 

0-3716 

0-54 

4-9819 

24-9095 

0-3537 

5-9783 

35-8697 

0-4251 

6-9747 

48-8266 

0-4960 

0-55 

4-9715 

24-8575 

0-4415 

5-9658 

35-7948 

0-5298 

6-9601 

48-7207 

0-6181 

0-56 

4-9589 

24-7945 

0-5291 

5-9507 

35-7041 

0-6349 

6-9425 

48-5972 

0-7408 

0-57 

4-9439 

24-7195 

0-6165 

5-9327 

35-5961 

0-7398 

6-9215 

48-4502 

0-8631 

0-58 

4-9266 

24-6330 

0-7030 

5-9119 

35-4715 

0-8436 

6-8972 

48-2807 

0-9842 

0-59 

4-9068 

24-5340 

0-7895 

!  5-8882 

35-3290 

0-9474 

6-8695 

48-0866 

•1053 

0-60 

4-8842 

24-4212 

0-8753 

I  5-8611 

35-1666 

1-0503 

6-8380 

47-8656 

•2254 

0-61 

4-8601 

24-3005 

0-9599 

1  5-8321 

34-9927 

•1518 

6-8041 

47-6290 

•3438 

0-62 

4-8329 

24-1645 

1-0439 

!  5-7995 

34-7969 

•2527 

6-7661 

47-3624 

•4615 

0-63 

4-8036 

24-0180 

•1269 

i  5-7643 

34-5859 

•3523 

6-7250 

47-0753 

•5777 

0-64 

4-7710 

23-8550 

•2094 

5-7252 

34-3512 

•4513 

6-6794 

46-7558 

•6932 

0-65 

4-7363 

23-6815 

•2906 

5-6836 

34-1014 

•5488 

6-6308 

46-4157 

•8069 

0-66 

4-6992 

23-4962 

•3703 

5-6391 

33-8346 

•6444 

6-5790 

46-0526 

1-9184 

0-67 

4-6585 

23-2925 

•4493 

5-5902 

33-5412 

•7391 

6-5219 

45-6533 

2-0290 

0-68 

4-6161 

23-0805 

•5265 

5-5393 

33-2359 

1-8319 

6-4625 

45-2378 

2-1372 

0-69 

4-5701 

22-8505 

•6023 

5-4841 

32-9047 

1-9227 

6-3981 

44-7870 

2-2432 

0-70 

4-5213 

22-6065 

•6765 

5-4256 

32-5534 

2-0118 

6-3298 

44-3087 

2-3471 

0-71 

4-4694 

22-3470 

•7489 

5-3633 

32-1797 

2-0987 

6-2572 

43-8001 

2-4484 

0-72 

4-4148 

22-0740 

1-8193 

5-2978 

31-7866 

2-1832 

6-1807 

43-2650 

2-5471 

0-73 

4-3565 

21-7825 

1-8877 

5-2278 

31-3668 

2-2652 

6-0991 

42-6937 

2-6427 

0-74 

4-2950 

21-4750 

1-9538 

5-1540 

30-9240 

2-3446 

6-0130 

42-0910 

2-7353 

0-75 

4-2302 

21-1510 

2-0174 

5-0762 

30-4574 

2-4209 

5-9223 

41-4560 

2-8243 

0-76 

4-1616 

20-8080 

2-0783 

4-9939 

29-9635 

2-4940 

5-8262 

40-7837 

2-9096 

0-77 

4-0893 

20-4465 

2-1363 

4-9072 

29-4430 

2-5635 

5-7250 

40-0751 

2-9908 

0-78 

4-0127 

20-0635 

2-1909 

4-8152 

28-8914 

2-6291 

5-6178 

39-3245 

3-0673 

0-79 

3-9324 

19-6618 

2-2422 

4-7188 

28-3129 

2-6907 

5-5053 

38-5370 

3-1391 

0-80 

3-8475 

19-2375 

2-2896 

4-6170 

27-7020 

2-7476 

5-3865 

37-7055 

3-2055 

0-81 

3-7576 

18-7882 

2-3327 

4-5092 

27-0551 

2-7993 

5-2607 

36-8250 

3-2658 

0-82 

3-6634 

18-3168 

2-3713 

4-3960 

26-3761 

2-8455 

5-1287 

35-9008 

3-3198 

0-83 

3-5644 

17-8218 

2-4049 

4-2772 

25-6631 

2-8858 

4-9901 

34-9306 

3-3668 

0-84 

3-4604 

17-3020 

2-4330 

4-1525 

24-9149 

2-9196 

4-8447 

33-9119 

3-4062 

0-85 

3-3485 

16-7425 

2-4541 

4-0182 

24-1092 

2-9449 

4-6879 

32-8153 

3-4358 

0-86 

3-2316 

16-1582 

2-4687 

3-8780 

23-2679 

2-9624 

4-5243 

31-6702 

3-4561 

0-87 

3-1075 

15-5375 

2-4751 

3-7290 

22-3740 

2-9701 

4-3505 

30-4535 

3-4652 

0-88 

2-9766 

14-8832 

2-4730 

3-5720 

21-4319 

2-9676 

4-1673 

29-1712 

3-4622 

0-89 

2-8364 

14-1822 

2-4600 

3-4037 

20-4222 

2-9521 

3-9710 

27-7972 

3-4441 

0-90 

2-6878 

13-4388 

2-4356 

3-2253 

19-3518 

2-9228 

3-7628 

26-3400 

3-4099 

0-91 

2-5295 

12-6475 

2-3980 

3-0354 

18-2124 

2-8776 

3-5413 

24-7891 

3-3572 

0-92 

2-3588 

11-7942 

2-3435 

2-8306 

16-9837 

2-8122 

3-3024 

23-1167 

3-2809 

0-93 

2-1754 

10-8770 

2-2700 

2-6105 

15-6629 

2-7240 

3-0456 

21-3189 

3-1780 

0-94 

1-9768 

9-8840 

2-1731 

2-3722 

14-2330 

2-6077 

2-7675 

19-3726 

3-0423 

0-95 

1-7595 

8-7975 

2-0463 

2-1114 

12-6684 

2-4556 

2-4633 

17-2431 

2-8648 

0-96 

1-5182 

7-5910 

1-8795 

1-8218 

10-9310 

2-2554 

2-1255 

14-8784 

2-6313 

0-97 

1-2339 

6-1695 

1-6411 

1-4807 

8-8841 

1-9693 

1-7275 

12-0922 

2-2975 

0-98 

0-9406 

4-7030 

1-3658 

1-1287 

6-7723 

1-6389 

1-3168 

9-2179 

1-9121 

0-99 

0-5634 

2-7172 

0-9269 

0-6761 

3-9568 

1-1123 

0-7888 

5-4218 

1-2976 

13—2 
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3.    Table  of  Muller-Urban  Weights*. 


p 

W 

P 

W 

J) 

W 

0-50 

1-000 

0-67 

0-932 

0-84 

0-694 

0-51 

1-000 

0-68 

0-923 

0-85 

0-670 

0-52 

0-999 

0-69          0-914 

0-86 

0-646 

0-53 

0-998 

0-70          0-904 

0-87 

0-621 

0-54 

0-996 

0-71           0-894 

0-88 

0-595 

0-55 

0-995 

0-72          0-883 

0-89 

0-567 

0-56 

0-992 

0-73 

0-871 

0-90 

0-538 

0-57 

0-989 

0-74          0-859 

0-91 

0-506 

0-58 

0-985 

0-75 

0-846 

0-92 

0-472 

059 

0-981 

0-76 

0-832 

0-93 

0-435 

0-60 

0-977 

0-77 

0-818 

0-94 

0-396 

0-61 

0-972 

0-78 

0-803 

0-95 

0-352 

0-62 

0-967 

0-79 

0-787 

0-96 

0-304 

0-63 

0-960 

0-80 

0-770 

0-97 

0-249 

0-64 

0-954 

0-81 

0-752 

0-98 

0-187 

0-65 

0-947 

0-82 

0-733 

0-99 

0-112 

0-66 

0-940 

0-83 

0-713 

1-00 

0-000 

The  weight  of  a  p  which  is  less  than  0-5  is  the  same  as  the  weight  of 
a  p  which  exceeds  0-5  by  the  same  amount.  Thus  the  weights  of  p  =  O25 
and  of  p  =  0-75  are  both  alike,  =  0-846. 

*  The  table  is  quoted  from  F.  M.  Urban,  "The  Method  of  Constant  Stimuli  and  its 
Generalisations,"  Psychological  Review,  1910,  xvn.  p.  253.  See  also  "Die  psychophysischen 
Massmethoden  als  Grundlagen  empirischer  Messungen,"  by  the  same  author,  Archiv  f.  d» 
get.  Psychologic,  xv.  and  xvi. 

4.   Reciprocals  of  pq,  where  p  4-  q  =  1. 


porq 

1 
pq 

porq 

1 

pq 

porq 

1 

m 

•50 

4-0 

•67 

4-5 

•84 

7-5 

•51 

4-0 

•68 

4-6 

•85 

7-9 

•52 

4-0 

•69 

4-7 

•86 

8-3 

•53 

4-0 

•70 

4-8 

•87 

8-8 

•54 

4-0 

•71 

4-9 

•88 

9-4 

•55 

4-1 

•72 

5-0 

•89 

10-2 

•56 

4-1 

•73 

5-1 

•90 

11-1 

•57 

4-1 

•74 

5-2 

•91 

12-2 

•58 

4-1 

•75 

5-3 

•92 

13-6 

•59 

4-1 

•76 

5-4 

•93 

15-4 

•60 

4-2 

•77 

5-6 

•94 

17-7 

•61 

4-2 

•78 

5-8 

•95 

21-0 

•62 

4-3 

•79 

6-0 

•96 

26-0 

•63 

4-3 

•80 

6-2 

•97 

34-4 

•64 

4-4 

•81 

6-5 

•98 

51-0 

•65 

4-4 

•82 

6-8 

•99 

101 

•66 

4-5 

•83 

7-1 

1-00 

00 
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5.   Rich's  Checking  Table  for  the  Constant  Method. 

Published  from  the  Laboratory  of  Cornell  University,  in 
Amer.  Journ.  Psychol.  1918,  xxix.  120. 

An  example  will  best  explain  the  use  of  this  Table.  Consider  the 
example  worked  on  p.  73.  If  we  form  for  each  line  the  totals  of  the 
five  quantities 

W  +  yW  +  sW  +  S*W  +  SyW 
we  get  the  results 


-5 

-4 

-3 

-2 

0 

2 

4 

6 


P 

•00 
•10 
•14 
•40 
•65 
•80 
•87 
•96 
1-00 


Totals 
•0000 
13-2371 
9-8835 
7-1880 
2-5836 
1-2274 
5-8355 
8-2559 
0-0000 


48-2110 

These  totals  however  will  all  be  found  in  Kich's  Table  entered  with 
the  proper  s  (Rich's  x)  and  p,  and  can  thus  be  checked.  The  grand  total 
48-2110  ought  to  agree  with  the  sum  of  the  last  row  of  the  table  (4)  on 
p.  73,  i.e.  with 

4-8026  +  0-4311  -  7-6406  +  43-7049  +  6-9130. 
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Rich's  Checking  Table. 


p 

x  =  l 

x  =  2 

z  =  3 

z=4 

x  =  o 

x  =  Q 

x=l 

•01 
•02 
•03 
•04 
•05 

-  -0327 
•0179 
•0843 
•1590 
•2371 

•2327 
•4973 
•7430 
•9978 
1-2355 

•7235 
1-3529 
1-8953 
2-4437 
2-9376 

1-4396 
2-5847 
3-5414 
4-4969 
5-3436 

2-2810 
4-1927 
5-6810 
7-1574 
8-4533 

3-4479 
6-1770 
8-3142 
10-4251 
12-2668 

4-8403 
8-5375 
11-4409 
14-3003 
16-7842 

•06 
•07 
•08 
•09 
•10 

•3170 
•3973 
•4780 
•5585 
•6386 

1-4637 
1-6836 
1-8963 
2-1025 
2-3015 

3-4012 
3-8400 
4-2582 
4-6583 
5-0397 

6-1295 
6-8667 
7-5637 
8-2259 
8-8530 

9-6485 
10-7635 
11-8126 
12-8053 
13-7415 

13-9583 
15-5305 
17-0052 
18-3965 
19-7048 

19-0586 
21-1676 
23-1413 
24-9995 
26-7434 

•11 
•12 
•13 
•14 
•15 

•7179 
•7967 
•8745 
•9515 
1-0275 

2-4951 
2-6835 
2-8655 
3-0431 
3-2155 

5-4068 
5-7609 
6-0994 
6-4274 
6-7428 

9-4531 
10-0289 
10-5764 
11-1043 
11-6096 

14-6339 
15-4875 
16-2964 
17-0737 
17-8158 

20-9491 
22-1370 
23-2594 
24-3361 
25-3614 

28-3994 
29-9770 
31-4653 
32-8910 
34-2463 

•16 
•17 

•18 
•19 
•20 

•1031 
•1767 
•2495 
•3215 
•3927 

3-3848 
3-5472 
3-7059 
3-8611 
4-0127 

7-0506 
7-3434 
7-6276 
7-9038 
8-1718 

12-1007 
12-5654 
13-0148 
13-4494 
13-8699 

18-5349 
19-2132 
19-8673 
20-4981 
21-1070 

26-3533 
27-2864 
28-1850 
29-0500 
29-8830 

35-5559 
36-7858 
37-9681 
39-1049 
40-1981 

•21 
•22 
•23 
•24 
•26 

•4627 
•5311 
•5991 
•6655 
•7310 

4-1600 
4-3032 
4-4432 
4-6792 
4-7118 

8-4304 
8-6802 
8-9231 
9-1575 
9-3846 

14-2737 
14-6624 
15-0388 
15-4004 
15-7494 

21-6901 
22-2496 
22-7901 
23-3079 
23-8063 

30-6791 
31-4418 
32-1773 
32-8800 
33-5552 

41-2413 
42-2393 
43-1999 
44-1169 
44-9965 

•26 
•27 
•28 
•29 
•30 

•7954 
•8589 
•9212 
1-9821 
2-0423 

4-8407 
4-9675 
6-0891 
5-2078 
5-3239 

9-6039 
9-8168 
10-0230 
10-2213 
10-4142 

16-0852 
16-4097 
16-7228 
17-0226 
17-3130 

24-2844 
24-7451 
25-1886 
25-6116 
26-0203 

34-2016 
34-8232 
35-4203 
35-9884 
36-5362 

45-8369 
46-6439 
47-4177 
48-1530 
48-8604 

•31 
•32 
•33 
•34 
•36 

2-1010 
2-1590 
2-2153 
2-2712 
2-3257 

5-4367 
5-6466 
5-6523 
5-7567 
5-8564 

10-6004 
10-7807 
10-9526 
11-1217 
11-2819 

17-5921 
17-8611 
18-1164 
18-3665 
18-6019 

26-4118 
26-7880 
27-1435 
27-4908 
27-8164 

37-0596 
37-5612 
38-0341 
38-4950 
38-9254 

49-5354 
50-1810 
50-7880 
51-3789 
51-9288 

•36 
•37 
•38 
•39 
•40 

2-3788 
2-4313 
2-4822 
2-5320 
2-5804 

5-9537 
6-0488 
6-1397 
6-2282 
6-3128 

11-4370 
11-5878 
11-7304 
11-8684 
11-9988 

18-8287 
19-0482 
19-2543 
19-4525 
19-6386 

28-1289 
28-4300 
28-7113 
28-9807 
29-2319 

39-3374 
39-7332 
40-1015 
40-4530 
40-7792 

52-4543 
52-9579 
53-4248 
53-8693 
54-2800 

•41 
•42 
•43 
•44 
45 

2-6284 
2-6747 
2-7198 
2-7638 
2-8063 

6-3958 
6-4754 
6-5516 
6-6251 
6-6952 

12-1261 
12-2468 
12-3609 
12-4698 
12-5727 

19-8191 
19-9887 
20-1479 
20-2983 

20-4388 

29-4748 
29-7013 
29-9124 
30-1103 
30-2935 

41-0933 
41-3845 
41-6545 
41-9059 
42-1368 

54-6743 
55-0384 
55-3741 
55-6849 
55-9687 

•46 
•47 

•48 
•49 
•50 

2-8478 
2-8878 
2-9263 
2-9640 
3-0000 

6-7625 
6-8264 
6-8872 
6-9454 
7-0000 

12-6700 
12-7610 
12-8461 
12-9263 
13-0000 

20-5703 
20-6915 
20-8033 
20-9069 
21-0000 

30-4634 
30-6180 
30-7587 
30-8870 
31-0000 

42-3486 
42-5403 
42-7122 
42-8667 
43-0000 

56-2310 
56-4585 
56-6638 
56-8459 
67-0000 
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p 

x  =  l 

x  =  2 

x  =  3 

z  =  4 

x  =  5 

x  =  Q 

x  =  7 

•51 
•52 
•53 
•54 
•55 

3-0348 
3-0683 
3-1002 
3-1306 
3-1595 

7-0516 
7-1000 
7-1450 
7-1869 
7-2250 

13-0679 
13-1299 
13-1858 
13-2358 
13-2791 

21-0839 
21-1581 
21-2225 
21-2777 
21-3218 

31-0994 
31-1843 
31-2552 
31-3122 
31-3531 

43-1145 

43-2088 
43-2835 
43-3402 
43-3730 

57-1291 
57-2314 
57-3079 
57-3644 
57-3815 

•56 
•57 

•58 
•59 
•60 

3-1870 
3-2130 
3-2371 
3-2600 
3-2804 

7-2599 
7-2914 
7-3190 
7-3432 
7-3630 

13-3164 
13-3473 
13-3716 
13-3893 
13-3992 

21-3565 
21  -3809 
21-3947 
21-3981 
21-3890 

31-3801 
31-3920 
31-3885 
31-3696 
31-3325 

43-3873 
43-3807 
43-3529 
43-3039 
43-2298 

57-3781 
57-3469 

57-2880 
57-2007 
57-0808 

•61 
•62 
•63 
•64 
•65 

3-3000 
3-3174 
3-3329 
3-3464 
3-3581 

7-3800 
7-3925 
7-4012 
7-4051 
7-4052 

13-4042 
13-4006 
13-3910 
13-3722 
13-3469 

21-3723 
21-3421 
21-3020 
21-2477 
21-1831 

31-2845 
31-2167 
31-1346 
31-0315 
30-9138 

43-1406 
43-0245 
42-8886 
42-7238 
42-5392 

56-9409 
56-7654 
56-5641 
56-3245 
56-0588 

•66 
•67 

•68 
•69 
•70 

3-3676 
3-3749 
3-3802 
3-3830 
3-3835 

7-4011 
7-3915 
7-3784 
7-3595 
7-3357 

13-3143 
13-2716 
13-2231 
13-1642 
13-0966 

21-1071 
21-0150 
20-9141 
20-7967 
20-6660 

30-7796 
30-6219 
30-4516 
30-2574 
30-0439 

42-3320 
42-0921 
41-8356 
41-5460 
41-2304 

55-7639 
55-4258 
55-0660 
54-6628 
54-2252 

•71 
•72 
•73 
•74 
•75 

3-3813 
3-3768 
3-3689 
3-3586 
3-3450 

7-3066 
7-2723 
7-2307 
7-1853 
7-1328 

13-0195 
12-9340 
12-8370 
12-7301 
12-6124 

20-5204 
20-3616 
20-1849 
19-9928 
19-7842 

29-8090 
29-5550 
29-2755 
28-9736 
28-6481 

40-8854 
40-5145 
40-1086 
39-6724 
39-2040 

53-7494 
53-2397 
52-6843 
52-0891 
51-4521 

•76 
•77 
•78 
•79 
•80 

3-3283 
3-3083 
3-2839 
3-2563 
3-2243 

7-0732 
7-0068 
6-9324 
6-8506 
6-7603 

12-4829 
12-3413 
12-1858 
12-0178 
11-8352 

19-5570 
19-3114 
19-0442 
18-7581 
18-4491 

28-2959 
27-9173 
27-5078 
27-0713 
26-6020 

38-6994 
38-1589 
37-5764 
36-9573 
36-2940 

50-7675 
50-0361 
49-2503 
48-4163 
47-5249 

•81 

•82 
•83 
•84 
•85 

3-1875 
3-1467 
3-1007 
3-0495 
2-9907 

6-6603 
6-5515 
6-4330 
6-3044 
6-1603 

11-6360 
11-4218 
11-1912 
10-9434 
10-6694 

18-1148 
17-7574 
17-3752 
16-9667 
16-5178 

26-0965 
25-5585 
24-9850 
24-3741 
23-7056 

35-5816 
34-8246 
34-0200 
33-1657 
32-2328 

46-5695 
45-5563 
44-4814 
43-3415 
42-0995 

•86 
•87 
•88 
•89 
•90 

2-9263 
2-8545 
2-7751 
2-6859 
2-5870 

6-0055 
5-8355 
5-6511 
5-4471 
5-2243 

10-3772 
10-0596 
9-7177 
9-3428 
8-9367 

16-0415 
15-5266 
14-9749 
14-3731 
13-7242 

22-9985 
22-2366 
21-4227 
20-5379 
19-5869 

31-2483 
30-1896 
29-0614 
27-8373 
26-5246 

40-7906 
39-3857 
37-8906 
36-2716 
34-5374 

•91 
•92 
•93 
•94 
•95 

2-4769 
2-3528 
2-2133 
2-0554 
1-8743 

4-9801 
4-7085 
4-4076 
4-0713 
3-6911 

8-4951 
8-0078 
7-4720 
6-8782 
6-2118 

13-0219 
12-2507 
11-4067 
10-4757 
9-4362 

18-5605 
17-4370 
16-2115 
14-8639 
13-3645 

25-1109 
23-5670 
21-8865 
20-0429 
17-9966 

32-6731 
30-6405 
28-4316 
26-0124 
23-3324 

•96 
•97 
•98 
•99 

1-6626 
1-3971 
1-1107 

•7089 

3-2532 
2-7122 
2-1363 
1-3451 

5-4509 
4-5211 
3-5383 
2-2065 

8-2559 
6-8236 
5-3163 
3-2934 

11-6682 
9-6196 

7-4707 
4-5056 

15-6877 
12-9092 
10-0012 
6-0433 

20-3147 
16-6923 
12-9081 
7-8063 
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p 

x=-l 

*=-6 

x=  -5 

x=  -4 

x=  -3 

x=  -2 

x=  -1 

x  =  0 

•01 
•02 
•03 
•04 
•05 

5-8579 
9-7281 
12-5809 
15-3119 
17-5872 

4-3203 
7-1974 
9-2914 
11-2923 
12-9552 

3-0080 
5-0431 
6-4954 
7-8800 
9-0269 

2-0210 
3-2649 
4-1930 
5-0749 
5-8024 

1-1595 
1-8631 
2-3841 
2-8773 
3-2818 

•5235 
•8375 
1-0686 
1-2868 
1-4649 

•1127 
•1881 
•2469 
•3036 
•3519 

-  -0727 
-  -0851 
-  -0813 
-  -0723 
-  -0574 

•06 
•07 
•08 
•09 
•10 

19-6082 
21-4324 
23-0983 
24-6313 
26-0376 

14-4293 
15-7575 
16-9684 
18-0809 
19-0998 

10-0411 
10-9527 
11-7820 
12-5423 
13-2371 

6-4437 
7-0181 
7-5391 
8-0155 
8-4496 

3-6368 
3-9536 
4-2398 
4-5005 
4-7373 

1-6207 
1-7592 
1-8841 
1-9973 
2-0999 

•3954 
•4351 
•4718 
•5059 
•5376 

-  -0392 
-  -0189 
•0031 
•0263 
•0505 

•11 
•12 
•13 
•14 
•15 

27-3456 
28-5668 
29-6947 
30-7546 
31-7421 

20-0459 
20-9282 
21-7416 
22-5049 
23-2148 

13-8811 
14-4803 
15-0316 
15-5479 
16-0270 

8-8507 
9-2231 
9-5646 
9-8835 
10-1786 

4-9550 
5-1565 
5-3406 
5-5118 
5-6696 

2-1939 

2-2805 
2-3595 
2-4327 
2-4999 

•5673 
•5953 
•6215 
•6463 
•6697 

•0753 
•1007 
•1265 
•1526 
•1789 

•16 
•17 

•18 
•19 
•20 

32-6789 
33-5392 
34-3503 
35-1151 
35-8361 

23-8875 
24-5036 
25-0840 
25-6302 
26-1442 

16-4801 
16-8942 
17-2831 
17-6483 
17-9912 

10-4569 
10-7102 
10-9474 
11-1696 
11-3773 

5-8178 
5-9520 
6-0772 
6-1938 
6-3024 

2-5628 
2-6196 
2-6723 
2-7211 
2-7665 

•6921 
•7129 
•7327 
•7515 
•7695 

•2055 
•2319 
•2584 
•2850 
•3116 

•21 
•22 
•23 
•24 
•25 

36-5089 
37-1383 
37-7315 
38-2837 
38-8005 

26-6229 
27-0696 
27-4899 
27-8802 
28-2446 

18-3097 
18-6060 
18-8841 
19-1413 
19-3807 

11-5695 
11-7474 
11-9140 
12-0670 
12-2088 

6-4022 
6-4942 
6-5795 
6-6575 
6-7292 

2-8080 
2-8458 
2-8808 
2-9126 
2-9416 

•7865 
•8025 
•8179 
•8323 
•8460 

•3381 
•3643 
•3906 
•4166 
•4425 

•26 
•27 
•28 
•29 
•30 

39-2815 
39-7311 
40-1505 
40-5354 
40-8950 

28-5828 
28-8980 
29-1911 
29-4592 
29-7086 

19-6020 
19-8075 
19-9976 
20-1706 
20-3307 

12-3392 
12-4595 
12-5702 
12-6698 
12-7614 

6-7945 
6-8542 
6-9084 
6-9567 
7-0004 

2-9677 
2-9905 
3-0127 
3-0314 
3-0481 

•8590 
•8713 
•8830 
•8939 
•9043 

•4682 
•4938 
•5191 
•5441 
•5690 

•31 
•32 
•33 
•34 
•35 

41-2256 
41-5304 
41-8022 
42-0577 
42-2810 

29-9368 
30-1464 
30-3319 
30-5056 
30-6558 

20-4762 
20-6088 
20-7251 
20-8330 
20-9250 

12-8435 
12-9177 
12-9816 
13-0401 
13-0889 

7-0390 
7-0731 
7-1016 
7-1269 
7-1471 

3-0625 
3-0750 
3-0849 
3-0935 
3-1000 

•9140 
•9232 
•9317 
•9398 
•9473 

•5935 
•6179 
•6418 
•6657 
•6892 

•36 
•37 
•38 
•39 
•40 

42-4819 
42-6633 
42-8156 
42-9487 
43-0548 

30-7896 
30-9092 
31-0079 
31-0924 
31-1576 

21-0057 
21-0766 
21-1333 
21-1803 
21-2141 

13-1303 
13-1654 
13-1919 
13-2121 
13-2242 

7-1632 
7-1758 
7-1836 
7-1880 
7-1880 

3-1045 
3-1076 
3-1085 
3-1080 
3-1056 

•9542 
•9607 
•9666 
•9720 
•9768 

•7123 
•7353 

•7578 
•7800 
•8018 

•41 
•42 
•43 
•44 
•45 

43-1459 
43-2124 
43-2573 
43-2815 
43-2847 

31-2117 
31-2479 
31-2687 
31-2743 
31-2648 

21-2402 
21-2541 
21-2576 
21-2507 
21-2335 

13-2315 
13-2309 
13-2241 
13-2107 
13-1908 

7-1853 
7-1784 
7-1681 
7-1542 
7-1367 

3-1020 
3-0966 
3-0896 
3-0811 
3-0712 

•9814 
•9853 
•9888 
•9918 
•9943 

•8235 

•8447 
•8655 
•8860 
•9060 

•46 
•47 
•48 
•49 
•50 

43-2736 
43-2305 
43-1732 
43-0967 
43-0000 

31-2422 
31-2019 
31-1488 
31-0817 
31-0000 

21-2070 
21-1694 
21-1225 
21-0662 
21-0000 

13-1653 
13-1327 
13-0945 
13-0503 
13-0000 

7-1162 
7-0920 
7-0645 
7-0339 
7-0000 

3-0599 
3-0470 
3-0326 
3-0170 
3-0000 

•9964 
•9980 
•9991 
•9998 
1-0000 

•9257 
•9449 
•9636 
•9821 
1-0000 
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Rich's  Checking  Table. 


p 

x  =  -7 

x=  -6 

x=  -5 

x=  -4 

x=-3 

x=-2 

x=-l 

x  =  0 

•51 
•52 
•53 
•54 
•55 

42-8843 
42-7476 
42-5935 
42-4230 
42-2251 

30-9047 
30-7942 
30-6711 
30-5334 
30-3818 

20-9246 
20-8389 
20-7446 
20-6410 
20-5271 

12-9441 
12-8817 
12-8141 
12-7407 
12-6610 

6-9631 
6-9227 
6-8796 
6-8332 
6-7835 

2-9816 
2-9618 
2-9408 
2-9183 
2-8946 

•9998 
•9991 
•9980 
•9964 
•9943 

1-0175 
1-0346 
1-0511 
1-0671 
1-0826 

•56 
•57 
•58 
•59 
•60 

42-0115 
41-7777 
41-5252 
41-2511 
40-9540 

30-2161 
30-0357 
29-8419 
29-6327 
29-4070 

20-4041 
20-2712 
20-1293 
19-9770 
19-8135 

12-5757 
12-4843 
12-3873 
12-2841 
12-1738 

6-7308 
6-6749 
6-6160 
6-5537 
6-4876 

2-8695 
2-8430 
2-8154 
2-7862 
2-7554 

•9918 
•9888 
•9853 
•9814 
•9768 

1-0976 
1-1121 
1-1259 
1-1393 
1-1518 

•61 
•62 
•63 
•64 
•65 

40-6451 
40-3102 
39-9587 
39-5793 
39-1834 

29-1728 
28-9201 
28-6554 
28-3708 
28-0744 

19-6445 
19-4631 
19-2736 
19-0707 
18-8600 

12-0603 
11-9393 
11-8132 
11-6789 
11-5401 

6-4202 
6-3486 
6-2742 
6-1956 
6-1145 

2-7242 
2-6909 
2-6568 
2-6207 
2-5836 

•9720 
•9666 
•9607 
•9542 
•9473 

1-1640 
1-1754 
1-1861 
1-1961 
1-2054 

•66 
•67 
•68 
•69 
•70 

38-7691 
38-3240 
37-8666 
37-3802 
36-8714 

27-7650 
27-4335 
27-0932 
26-7324 
26-3556 

18-6406 
18-4063 
18-1664 
17-9126 
17-6483 

11-3959 
11-2426 
11-0859 
10-9209 
10-7496 

6-0307 
5-9422 
5-8519 
5-7572 
5-6592 

2-5455 
2-5053 
2-4644 
2-4217 
2-3775 

•9398 
•9317 
•9232 
•9140 
•9043 

1-2139 
1-2216 
1-2285 
1-2345 
1-2396 

•71 
•72 
•73 

•74 
•75 

36-3382 
35-7841 
35-2007 
34-5925 
33-9589 

25-9614 
25-5525 
25-1226 
24-6752 
24-2098 

17-3724 
17-0868 
16-7871 
16-4760 
16-1529 

10-5712 
10-3870 
10-1943 
9-9948 
9-7880 

5-5577 
5-4530 
5-3440 
5-2315 
5-1154 

2-3318 
2-2851 
2-2373 
2-1863 
2-1346 

•8939 

•8830 
•8713 
•8590 
•8460 

1-2437 
1-2469 
1-2488 
1-2498 
1-2495 

•76 

•77 
•78 
•79 
•80 

33-2959 
32-6045 
31-8801 
31-1275 
30-3409 

23-7236 
23-2175 
22-6878 
22-1383 
21-5648 

15-8161 
15-4661 
15-1006 
14-7221 
14-3278 

9-5732 
9-3506 
9-1184 

8-8787 
8-6297 

4-9949 
4-8705 
4-7414 
4-6084 
4-4706 

2-0814 
2-0264 
•9694 
•9110 

•8505 

•8323 
•8179 
•8026 
•7865 
•7695 

1-2480 
1-2462 
1-2407 
1-2349 
1-2274 

•81 

•82 
•83 
•84 
•85 

29-5165 
28-6593 
27-7676 
26-8397 
25-8521 

20-9646 
20-3416 
19-6940 
19-0215 
18-3066 

13-9159 
13-4891 
13-0464 
12-5873 
12-1004 

8-3702 
8-1020 
7-8244 
7-5373 
7-2336 

4-3276 
4-1802 
4-0282 
3-8714 
3-7062 

•7879 
•7239 
•6578 
•5896 
•5183 

•7515 
•7327 
•7129 
•6921 
•6697 

1-2180 
1-2070 
1-1939 
1-1787 
1-1605 

•86 
•87 
•88 
•89 
•90 

24-8298 
23-7543 
22-6316 
21-4414 
20-1920 

17-5675 
16-7914 
15-9822 
15-1257 
14-2284 

11-5979 
11-0714 
10-5235 
9-9451 
9-3401 

6-9211 
6-5944 
6-2555 
5-8987 
5-5268 

3-5368 
3-3604 
3-1781 
2-9870 

2-7887 

•4451 
•3695 
1-2913 
1-2099 
1-1255 

•6463 
•6215 
•5953 
•5673 
•5376 

1-1400 
1-1165 
1-0899 
1-0593 
1-0247 

•91 
•92 
•93 
•94 
•95 

18-8761 
17-4739 
15-9844 
14-3928 
12-6762 

13-2849 
12-2814 
11-2175 
10-0831 
8-8626 

8-7055 
8-0324 
7-3207 
6-5641 
5-7529 

5-1379 
4-7269 
4-2941 
3-8359 
3-3470 

2-5821 
2-3650 
2-1376 
1-8982 
1-6448 

1-0381 
•9467 
•8512 
•7515 
•6465 

•5059 
•4718 
•4351 
•3954 
•3519 

•9855 
•9405 
•8891 
•8300 
•7612 

•96 
•97 
•98 
•99 

10-8011 
8-6423 
6-4503 
3-6335 

7-5333 
6-0092 
4-4660 
2-4665 

4-8728 
3-8696 
2-8579 
1-5250 

2-8195 
2-2236 
1-6261 

•9088 

1-3737 
1-0711 
•7705 
•4181 

•5350 
•4122 
•2913 
•1527 

•3036 
•2469 
•1881 
•1127 

•6795 
•5751 
•4613 
•2981 

APPENDIX  II 

A  LIST  OF  DEFINITE  INTEGRALS  OF  FREQUENT 
OCCURRENCE  IN  PROBABILITY  WORK 


A=  I    ze-**dz=%,          =  zero. 

./0  J  -oo 

*«fe  =  ^,  f  _Vw. 

^  J  -oo 


/•oo  -no 

D=  I    23e-^^=j}    |      =  zero. 

.'0  J  -ao 


,  1.8.5...  (»-, 

=       z»e-*  dz  =    -  -  (n  even) 


Or 


2.2.2.2...2 


^ 

J    „ 


G 


f* 

J 


2 
M  7T 


but  I     is 


f 
I 

a_  f  e-^^.dz  =  v-  -Q^E  but  r  j.^1. 

J  -oo  VV  .'0  ^ 

r00    r00     -(P&+2RXV+W) 
"6 

HPQ>RZ,  but 


J  o  J  o  4 

K  =         e  2<2z  = —  e~     Q     ,  but  I     is  not  -^ 

J  -oo  Q$  Jo  2 


r    I"0 
./-co  .'- 
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xydxdy  =  -         *R  _i ,  if  PQ  >  R\ 

/•«/•«  .          L 

but  is  not  -r 

Jo  Jo  4 


if  PO 

' 


NOTE.  In  the  above  integrals  P  and  (?  do  not,  as  often,  represent 
constants  whose  sum  is  unity.  R  is  not  a  correlation  coefficient,  although 
it  is  closely  connected  therewith. 
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NOTATION 

A  simple,  consistent,  and  easily  remembered  notation  is  an  essential 
in  any  mathematical  treatment.  The  chief  difficulty  in  the  way  of 
obtaining  consistency  in  a  work  like  the  present  is  the  desire  naturally 
felt  to  keep  any  piece  of  research  in  the  notation  used  by  its  originator. 
But  a  great  effort  has  been  made  to  attain  a  notation  which  could  be 
used  throughout  the  book  without  too  seriously  altering  the  appearance 
of  any  well-known  formula.  Most  of  the  symbols  in  the  following 
Table  have  been  used  by  several  important  writers  with  the  meaning 
here  indicated.  They  are  either  always  used  with  this  meaning:  or  if 
there  be  any  exception  it  is  hoped  that  the  context  will  make  clear  what 
is  meant. 

The  greatest  difficulty  has  been  felt  over  the  choice  of  a  symbol 
for  a  stimulus  value.  The  commonest  symbols  hitherto  used  have  been 
R  and  S  for  stimulus  and  sensation  intensity  respectively,  and  r  has 
been  used  instead  of  R  by  at  least  one  very  important  writer.  But 
there  is  still  greater  unanimity  of  usage  in  employing  r  and  R  for  correla- 
tion coefficients.  In  adopting  small  s  for  stimulus,  we  realise  that  there 
is  some  danger  of  confusion  with  great  S  used  in  the  past,  by  writers 
on  Weber's  Law,  for  sensation  intensity.  But  some  of  the  same  writers 
also  use  S  for  a  standard  stimulus  (as  we  do) :  and  after  the  first  chapter 
we  have  no  necessity  to  speak  of  sensation-intensities  at  all,  it  is  frankly 
physical  stimuli  which  are  measured.  In  the  first  chapter  itself  ambiguity 
or  misunderstanding  is  avoided  by  printing  the  words  at  full  length. 

It  is  hoped  that  this  list  of  symbols  may  assist  in  obtaining  unanimity 
and  consistency  in  the  notation  of  the  subject  and  serve  as  a  convenient 
reference. 
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SYMBOLS  USED 

Symbol  Meaning 

a  a  mean,  w  is  also  used  for  a  mean,  and  any  quantity  with  a  bar,  as  x,  is  a  mean. 

A  a  true  value. 

«i»  «2>  °*     readings  of  which  a  is  the  mean. 

6  a  coefficient  of  regression.   Usually  suffixes  are  added. 

c  and  C    constants,   k  is  also  used  in  this  sense. 

Cj,  Cz,  and  0    contingency  symbols. 

5  a  deviation:  also  used  for  a  finite  increment,  d  indicating  an  infinitesimal 

increment. 
e  an  error. 

h  modulus  of  precision  in  one  notation  of  the  Gaussian  or  Normal  Curve  of  Error. 

k  a  constant.  Also  c  and  C.  See  also  n. 

I  see  n. 

m  a  mean.   See  also  n. 

M  Muller's  weights.  See  w. 

H  and  v    moments. 
N  see  n. 

n  a  number  (of  experiments,  or  the  like).  Where  necessary,  N,  and  also  k,  I,  m, 

are  used  in  this  sense,   m  more  commonly  is  used  for  a  mean. 
p  a  mathematical  measure  of  probability. 

P  same  as  p. 

Q  same  as  p.   Usually  Q  is  1  -P:  also  used,  usually  with  a  suffix,  for  a  quartile. 

q  same  as  p.    Usually  q  is  1  —p. 

r  a  correlation  coefficient. 

R  a  correlation  coefficient. 

77  a  correlation  ratio. 

*  a  stimulus  value.  S  is  a  standard  stimulus,  F  a  variable,  a  is  also  on  one  occasion 

used  for  a  standard  deviation. 

<r  a  standard  deviation.  Also  2  is  used,  and  on  one  occasion  s. 

S  a  standard  deviation.  Would  also  be  used  for  a  summation  of  any  kind  other 

than  that  for  which  8  is  used  in  this  book;  but  no  such  occasion  arises  in 

the  present  work. 
S  a  sum  over  the  distribution,  e.g.  S  (e)  is  the  sum  of  the  errors.  8  is  also  used  for 

a  standard  stimulus,  see  small  s  and  also  the  remarks  preceding  this  table. 
t  t,  with  x,  y  and  z,  is  used  as  a  variable  of  integration. 

T  Thresholds,  both  absolute  and  difference.    Sometimes  with  suffixes  as  Tv  and 

Tl  for  upper  and  lower  difference  thresholds  or  limina. 
u  u  and  v  "residuals"  in  observation  equations. 

v  see  u. 

V  variable  stimulus,  see  s.  Also  coefficient  of  variation. 

to  "weight"  of  a  reading  or  an  equation.   Also  W,  and  M,  the  latter  for  Muller's 

weights  only. 
W  see  w. 

X  in  connection  with  goodness  of  fit. 

x  x,  yt  z  and  t  are  used  as  variables  of  integration. 

y  see  a. 

z  see  x. 
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